Abstract
This paper describes a supervised approach to the recognition of biological events, which combines statistical sequential labelling and symbolic event extractionrules. Bottom-up textual induction has been applied to generating eventextraction rules. As an evaluation data set, we use a corpus of biomedical abstracts, in which biological events concerning gene regulation in E. coli and H. Sapiens have been annotated by a group of biologists. The event instance extraction performance has been evaluated using 10- fold cross validation. The experimental results show that named entity recognition (NER) and semantic role labelling (SRL) performance are close to annotator performance, as indicated by the interannotator agreement (IAA) scores, whereas automatic event extraction performance is around 28%, as compared to 40% IAA for exact manual event extraction.
Original language | English |
---|---|
Title of host publication | Proceedings of the 3rd International Symposium on Languages in Biology and Medicine (LBM-2009) |
Pages | 91-96 |
Number of pages | 6 |
Publication status | Published - 2009 |
Event | 3rd International Symposium on Languages in Biology and Medicine (LBM-2009) - Duration: 1 Jan 1824 → … |
Conference
Conference | 3rd International Symposium on Languages in Biology and Medicine (LBM-2009) |
---|---|
Period | 1/01/24 → … |
Keywords
- text mining
- biology
- event extraction
- predicate argument structure
- machine learning