MURAL - Maynooth University Research Archive Library



    Automating XML mark-up using a two stage machine learning technique


    Akhtar, Shazia and Reilly, Ronan and Dunnion, John (2001) Automating XML mark-up using a two stage machine learning technique. Other. UNSPECIFIED. (Unpublished)

    [img]
    Preview
    Download (120kB) | Preview


    Share your research

    Twitter Facebook LinkedIn GooglePlus Email more...



    Add this article to your Mendeley library


    Abstract

    We introduce a novel two-stage automatic XML mark-up system, which combines the WEBSOM approach to document categorisation in conjunction with the C5 inductive learning algorithm. The WEBSOM method clusters the XML marked-up documents such that semantically similar documents lie close together on a Self-Organising Map (SOM). The C5 algorithm automatically learns and applies mark-up rules derived from the nearest SOM neighbours of an unmarked document. The system learns from mark-up errors to improve accuracy. The automatically marked-up documents produced by the system are also categorized on the Self-Organizing Map, to further refine SOM's document coverage.

    Item Type: Monograph (Other)
    Keywords: automatic mark-up; C5/See5; machine learning; Self-organizing Map; WEBSOM; XML;
    Academic Unit: Faculty of Science and Engineering > Computer Science
    Item ID: 10454
    Depositing User: Prof. Ronan Reilly
    Date Deposited: 23 Jan 2019 17:34
    URI:

      Repository Staff Only(login required)

      View Item Item control page

      Downloads

      Downloads per month over past year