Imbalanced classification using dictionary-based prototypes and hierarchical decision rules for entity sense disambiguation

Tingting Mu, Xinglong Wang, Jun'ichi Tsujii, Sophia Ananiadou

    Research output: Chapter in Book/Conference proceedingConference contributionpeer-review

    Abstract

    Entity sense disambiguation becomes difficult with few or even zero training instances available, which is known as imbalanced learning problem in machine learning. To overcome the problem, we create a new set of reliable training instances from dictionary, called dictionarybased prototypes. A hierarchical classification system with a tree-like structure is designed to learn from both the prototypes and training instances, and three different types of classifiers are employed. In addition, supervised dimensionality reduction is conducted in a similarity-based space. Experimental results show our system outperforms three baseline systems by at least 8.3% as measured by macro F 1 score.
    Original languageEnglish
    Title of host publicationColing 2010 - 23rd International Conference on Computational Linguistics, Proceedings of the Conference|Coling - Int. Conf. Comput. Linguist., Proc. Conf.
    PublisherAssociation for Computational Linguistics
    Pages851-859
    Number of pages8
    Volume2
    Publication statusPublished - 2010
    Event23rd International Conference on Computational Linguistics, Coling 2010 - Beijing
    Duration: 1 Jul 2010 → …

    Conference

    Conference23rd International Conference on Computational Linguistics, Coling 2010
    CityBeijing
    Period1/07/10 → …

    Fingerprint

    Dive into the research topics of 'Imbalanced classification using dictionary-based prototypes and hierarchical decision rules for entity sense disambiguation'. Together they form a unique fingerprint.

    Cite this