MURAL - Maynooth University Research Archive Library



    Developing Corpora for Statistical Graphical Language Models


    O'Sullivan, Andrew, Keyes, Laura and Winstanley, Adam C. (2006) Developing Corpora for Statistical Graphical Language Models. In: International Conference on Image Processing, Computer Vision, and Pattern Recognition (IPCV'06), 26-29 June 2006, Las Vegas.

    [thumbnail of AW_Developing_Corpora.pdf]
    Preview
    Text
    AW_Developing_Corpora.pdf

    Download (99kB) | Preview

    Abstract

    In this work Statistical Graphical Language Models (SGLMs), a technique adapted from Statistical Language Models (SLMs), are applied to the task of graphical object recognition. SLMs are used in Natural Language Processing for tasks such as Speech Recognition and Information Retrieval. SGLMs view graphical objects as belonging to graphical languages and use this view to compute probabilistic distributions of graphical objects within graphical documents. SGLMs such as N-grams require large corpora of training data, which consist of graphical objects in contextual use (real world graphical documents). Constructing corpora is an important stage in developing the models and many issues need to be addressed. This paper discusses the development of graphical corpora and presents approaches to some of the problems encountered.
    Item Type: Conference or Workshop Item (Paper)
    Keywords: Graphics Recognition; Statistical Language Modelling; Corpora;
    Academic Unit: Faculty of Science and Engineering > Computer Science
    Item ID: 4915
    Depositing User: Dr. Adam Winstanley
    Date Deposited: 25 Apr 2014 15:38
    Refereed: No
    URI: https://mural.maynoothuniversity.ie/id/eprint/4915
    Use Licence: This item is available under a Creative Commons Attribution Non Commercial Share Alike Licence (CC BY-NC-SA). Details of this licence are available here

    Repository Staff Only (login required)

    Item control page
    Item control page

    Downloads

    Downloads per month over past year

    Origin of downloads