Event frame extraction based on a gene regulation corpus

Yutaka Sasaki, Paul Thompson, Philip Cotter, John McNaught, Sophia Ananiadou

    Research output: Chapter in Book/Conference proceedingConference contributionpeer-review

    Abstract

    This paper describes the supervised acquisition of semantic event frames based on a corpus of biomedical abstracts, in which the biological process of E. coli gene regulation has been linguistically annotated by a group of biologists in the EC research project "BOOTStrep". Gene regulation is one of the rapidly advancing areas for which information extraction could boost research. Event frames are an essential linguistic resource for extraction of information from biological literature. This paper presents a specification for linguistic-level annotation of gene regulation events, followed by novel methods of automatic event frame extraction from text. The event frame extraction performance has been evaluated with 10 fold cross validation. The experimental results show that a precision of nearly 50% and a recall of around 20% are achieved. Since the goal of this paper is event frame extraction, rather than event instance extraction, the issue of low recall could be solved by applying the methods to a larger-scale corpus. © 2008. Licensed under the Creative Commons.
    Original languageEnglish
    Title of host publicationColing 2008 - 22nd International Conference on Computational Linguistics, Proceedings of the Conference|Coling - Int. Conf. Comput. Linguist., Proc. Conf.
    Pages761-768
    Number of pages7
    Volume1
    Publication statusPublished - 2008
    Event22nd International Conference on Computational Linguistics, Coling 2008 - Manchester
    Duration: 1 Jul 2008 → …
    http://www.aclweb.org/anthology/C08-1035

    Conference

    Conference22nd International Conference on Computational Linguistics, Coling 2008
    CityManchester
    Period1/07/08 → …
    Internet address

    Fingerprint

    Dive into the research topics of 'Event frame extraction based on a gene regulation corpus'. Together they form a unique fingerprint.

    Cite this