The Hybridisation of a Data-driven Parser for Natural Languages

Sardar Jaf, Allan Ramsay

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Abstract

    Identifying and establishing structural relations between words in natural language sentences is called Parsing . Ambiguities in natural languages make parsing a difficult task. Parsing is even more difficult when dealing with a structurally complex natural language such as Arabic, which has a number of properties that make it particularly difficult to handle. In this paper, we briefly highlight some of the complex structure of Arabic, and we identify different parsing approaches (grammar-driven and data-driven approaches) and briefly discuss their limitations. Our main goal is to produce a hybrid parser, by combining different parsing approaches, which retains the advantages of data-driven approaches but is guided by a set of grammatical rules to produce more accurate results. We describe a novel technique for directly combining different parsing approaches. Results for our initial experiments that we have conducted in this work, and our plans for future work are also presented.
    Original languageEnglish
    Title of host publication 6th International Conference on Language and Technology
    Pages531-535
    Number of pages5
    Publication statusPublished - 2013
    Event6th International Conference on Language and Technology - Polan
    Duration: 1 Jan 1824 → …

    Conference

    Conference6th International Conference on Language and Technology
    CityPolan
    Period1/01/24 → …

    Keywords

    • parsing
    • hybrid parsing
    • data-driven parsing
    • grammar-driven parsing
    • Arabic parsing
    • NLP parsing

    Fingerprint

    Dive into the research topics of 'The Hybridisation of a Data-driven Parser for Natural Languages'. Together they form a unique fingerprint.

    Cite this