Akhtar, Shazia and Reilly, Ronan and Dunnion, John
(2001)
Automating XML mark-up using a two stage
machine learning technique.
Other.
UNSPECIFIED.
(Unpublished)
Abstract
We introduce a novel two-stage automatic XML mark-up system, which combines the
WEBSOM approach to document categorisation in conjunction with the C5 inductive learning
algorithm. The WEBSOM method clusters the XML marked-up documents such that semantically
similar documents lie close together on a Self-Organising Map (SOM). The C5 algorithm
automatically learns and applies mark-up rules derived from the nearest SOM neighbours of an
unmarked document. The system learns from mark-up errors to improve accuracy. The
automatically marked-up documents produced by the system are also categorized on the Self-Organizing Map, to further refine SOM's document coverage.
Item Type: |
Monograph
(Other)
|
Keywords: |
automatic mark-up; C5/See5; machine learning; Self-organizing Map; WEBSOM; XML; |
Academic Unit: |
Faculty of Science and Engineering > Computer Science |
Item ID: |
10454 |
Depositing User: |
Prof. Ronan Reilly
|
Date Deposited: |
23 Jan 2019 17:34 |
URI: |
|
Use Licence: |
This item is available under a Creative Commons Attribution Non Commercial Share Alike Licence (CC BY-NC-SA). Details of this licence are available
here |
Repository Staff Only(login required)
|
Item control page |
Downloads per month over past year
Origin of downloads