Anatomical Entity Mention Recognition at Literature Scale

S Pyysalo, S Ananiadou

    Research output: Contribution to journalArticlepeer-review

    Abstract

    Motivation: Anatomical entities ranging from subcellular structures to organ systems are central to biomedical science, and mentions of these entities are essential to understanding the scientific literature. Despite extensive efforts to automatically analyze various aspects of biomedical text, there have been only few studies focusing on anatomical entities, and no dedicated methods for learning to automatically recognize anatomical entity mentions in free-form text have been introduced.
    Results: We present AnatomyTagger, a machine learning-based system for anatomical entity mention recognition. The system incorporates a broad array of approaches proposed to benefit tagging, including the use of Unified Medical Language System (UMLS)- and Open Biomedical Ontologies (OBO)-based lexical resources, word representations induced from unlabeled text, statistical truecasing and non-local features. We train and evaluate the system on a newly introduced corpus that substantially extends on previously available resources, and apply the resulting tagger to automatically annotate the entire open access scientific domain literature. The resulting analyses have been applied to extend services provided by the Europe PubMed Central literature database.
    Original languageEnglish
    Pages (from-to)868-875
    Number of pages8
    JournalBioinformatics
    Volume30
    Issue number6
    DOIs
    Publication statusPublished - 15 Mar 2014

    Fingerprint

    Dive into the research topics of 'Anatomical Entity Mention Recognition at Literature Scale'. Together they form a unique fingerprint.

    Cite this