Abstract
Named entity disambiguation concerns linking a potentially ambiguous mention of named entity in text to an unambiguous identifier in a standard database. One approach to this task is supervised classification. However, the availability of training data is often limited, and the available data sets tend to be imbalanced and, in some cases, heterogeneous. We propose a new method that distinguishes a named entity by finding the informative keywords in its surrounding context, and then trains a model to predict whether each keyword indicates the semantic class of the entity. While maintaining a comparable performance to supervised classification, this method avoids using expensive manually annotated data for each new domain, and thus achieves better portability. © 2009 ACL and AFNLP.
Original language | English |
---|---|
Title of host publication | EMNLP 2009 - Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: A Meeting of SIGDAT, a Special Interest Group of ACL, Held in Conjunction with ACL-IJCNLP 2009|EMNLP - Proc. Conf. Empir. Methods Nat. Lang. Process.: Meet. SIGDAT, Spec. Interest Group ACL, Held Conjunction ACL-IJCNLP |
Pages | 1513-1522 |
Number of pages | 9 |
Publication status | Published - 2009 |
Event | 2009 Conference on Empirical Methods in Natural Language Processing, EMNLP 2009, Held in Conjunction with ACL-IJCNLP 2009 - Singapore Duration: 1 Jul 2009 → … |
Conference
Conference | 2009 Conference on Empirical Methods in Natural Language Processing, EMNLP 2009, Held in Conjunction with ACL-IJCNLP 2009 |
---|---|
City | Singapore |
Period | 1/07/09 → … |