Abstract
Entity sense disambiguation becomes difficult with few or even zero training instances available, which is known as imbalanced learning problem in machine learning. To overcome the problem, we create a new set of reliable training instances from dictionary, called dictionarybased prototypes. A hierarchical classification system with a tree-like structure is designed to learn from both the prototypes and training instances, and three different types of classifiers are employed. In addition, supervised dimensionality reduction is conducted in a similarity-based space. Experimental results show our system outperforms three baseline systems by at least 8.3% as measured by macro F 1 score.
Original language | English |
---|---|
Title of host publication | Coling 2010 - 23rd International Conference on Computational Linguistics, Proceedings of the Conference|Coling - Int. Conf. Comput. Linguist., Proc. Conf. |
Publisher | Association for Computational Linguistics |
Pages | 851-859 |
Number of pages | 8 |
Volume | 2 |
Publication status | Published - 2010 |
Event | 23rd International Conference on Computational Linguistics, Coling 2010 - Beijing Duration: 1 Jul 2010 → … |
Conference
Conference | 23rd International Conference on Computational Linguistics, Coling 2010 |
---|---|
City | Beijing |
Period | 1/07/10 → … |