Akhtar, Shazia and Reilly, Ronan and Dunnion, John
(2002)
AutoMarkup: A Tool for Automatically Marking up Text Documents.
In:
CICLing 2002: Computational Linguistics and Intelligent Text Processing.
Lecture Notes in Computer Science (LNCS)
(2276).
Springer, pp. 433-435.
ISBN 3540432191
Abstract
In this paper we present a novel system that can automatically mark up text documents into XML. The system uses the Self-Organizing Map (SOM) algorithm to organize marked documents on a map so that similar documents are placed on nearby locations. Then by using the inductive learning algorithm C5, it automatically generates and applies the markup rules from the nearest SOM neighbours of an unmarked document. The system is adaptive in nature and learns from errors in the automatically marked-up document to improve accuracy. The automatically marked-up documents are again arranged on the SOM.
Item Type: |
Book Section
|
Additional Information: |
Cite this paper as: Akhtar S., Reilly R.G., Dunnion J. (2002) AutoMarkup: A Tool for Automatically Marking up Text Documents. In: Gelbukh A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2002. Lecture Notes in Computer Science, vol 2276. Springer, Berlin, Heidelberg |
Keywords: |
AutoMarkup; Automatic; Mark up; Text Documents; XML; Self-Organizing Map (SOM); algorithm; |
Academic Unit: |
Faculty of Science and Engineering > Computer Science |
Item ID: |
8215 |
Identification Number: |
https://doi.org/10.1007/3-540-45715-1_46 |
Depositing User: |
Prof. Ronan Reilly
|
Date Deposited: |
15 May 2017 14:42 |
Publisher: |
Springer |
Refereed: |
Yes |
URI: |
|
Use Licence: |
This item is available under a Creative Commons Attribution Non Commercial Share Alike Licence (CC BY-NC-SA). Details of this licence are available
here |
Repository Staff Only(login required)
|
Item control page |
Downloads per month over past year
Origin of downloads