MURAL - Maynooth University Research Archive Library



    Automating XML markup of text documents


    Akhtar, Shazia, Reilly, Ronan and Dunnion, John (2001) Automating XML markup of text documents. Other. UNSPECIFIED. (Unpublished)

    [thumbnail of RR-Automating-2001.pdf]
    Preview
    Text
    RR-Automating-2001.pdf

    Download (120kB) | Preview

    Abstract

    We present a novel system for automatically marking up text documents into XML and discuss the benefits of XML markup for intelligent information retrieval. The system uses the Self-Organizing Map (SOM) algorithm to arrange XML marked-up documents on a two dimensional map so that similar documents appear closer to each other. It then employs an inductive learning algorithm C5 to automatically extract and apply markup rules from the nearest SOM neighbours of an unmarked document. The system is designed to be adaptive, so that once a document is marked-up; its behaviour is modified to improve accuracy. The automatically marked-up documents are again categorized on the Self-Organizing Map.
    Item Type: Monograph (Other)
    Keywords: Automatic XML markup; text documents; Self-Organizing Map (SOM) algorithm;
    Academic Unit: Faculty of Science and Engineering > Computer Science
    Item ID: 10448
    Depositing User: Prof. Ronan Reilly
    Date Deposited: 23 Jan 2019 16:01
    Refereed: Yes
    URI: https://mural.maynoothuniversity.ie/id/eprint/10448
    Use Licence: This item is available under a Creative Commons Attribution Non Commercial Share Alike Licence (CC BY-NC-SA). Details of this licence are available here

    Repository Staff Only (login required)

    Item control page
    Item control page

    Downloads

    Downloads per month over past year

    Origin of downloads