MURAL - Maynooth University Research Archive Library



    Automating XML mark-up using a two stage machine learning technique


    Akhtar, Shazia, Reilly, Ronan and Dunnion, John (2001) Automating XML mark-up using a two stage machine learning technique. Other. UNSPECIFIED. (Unpublished)

    [thumbnail of RR-Automating-2001.pdf]
    Preview
    Text
    RR-Automating-2001.pdf

    Download (120kB) | Preview

    Abstract

    We introduce a novel two-stage automatic XML mark-up system, which combines the WEBSOM approach to document categorisation in conjunction with the C5 inductive learning algorithm. The WEBSOM method clusters the XML marked-up documents such that semantically similar documents lie close together on a Self-Organising Map (SOM). The C5 algorithm automatically learns and applies mark-up rules derived from the nearest SOM neighbours of an unmarked document. The system learns from mark-up errors to improve accuracy. The automatically marked-up documents produced by the system are also categorized on the Self-Organizing Map, to further refine SOM's document coverage.
    Item Type: Monograph (Other)
    Keywords: automatic mark-up; C5/See5; machine learning; Self-organizing Map; WEBSOM; XML;
    Academic Unit: Faculty of Science and Engineering > Computer Science
    Item ID: 10454
    Depositing User: Prof. Ronan Reilly
    Date Deposited: 23 Jan 2019 17:34
    URI: https://mural.maynoothuniversity.ie/id/eprint/10454
    Use Licence: This item is available under a Creative Commons Attribution Non Commercial Share Alike Licence (CC BY-NC-SA). Details of this licence are available here

    Repository Staff Only (login required)

    Item control page
    Item control page

    Downloads

    Downloads per month over past year

    Origin of downloads