PathNER: A tool for systematic identification of biological pathway mentions in the literature

    Research output: Contribution to journalArticlepeer-review

    Abstract

    BACKGROUND: Biological pathways are central to many biomedical studies and are frequently discussed in the literature. Several curated databases have been established to collate the knowledge of molecular processes constituting pathways. Yet, there has been little focus on enabling systematic detection of pathway mentions in the literature. RESULTS : We developed a tool, named PathNER (Pathway Named Entity Recognition), for the systematic identification of pathway mentions in the literature. PathNER is based on soft dictionary matching and rules, with the dictionary generated from public pathway databases. The rules utilise general pathway-specific keywords, syntactic information and gene/protein mentions. Detection results from both components are merged. On a gold-standard corpus, PathNER achieved an F1-score of 84%. To illustrate its potential, we applied PathNER on a collection of articles related to Alzheimer's disease to identify associated pathways, highlighting cases that can complement an existing manually curated knowledgebase. CONCLUSIONS : In contrast to existing text-mining efforts that target the automatic reconstruction of pathway details from molecular interactions mentioned in the literature, PathNER focuses on identifying specific named pathway mentions. These mentions can be used to support large-scale curation and pathway-related systems biology applications, as demonstrated in the example of Alzheimer's disease. PathNER is implemented in Java and made freely available online at http://sourceforge.net/projects/pathner/.
    Original languageEnglish
    JournalBMC Systems Biology
    Volume7
    Issue numberSuppl3
    DOIs
    Publication statusPublished - 16 Oct 2013

    Fingerprint

    Dive into the research topics of 'PathNER: A tool for systematic identification of biological pathway mentions in the literature'. Together they form a unique fingerprint.

    Cite this