Abstract
Sierra & McNaught (2000) proposed a clustering algorithm based on analogy. It takes as input pairs of definitions sharing the same term headword, in the same domain but from different sources, compares these definitions and identifies couples of words in the same extended semantic context. These couples are then grouped to yield semantically-related clusters. Here, we further evaluate the clustering algorithm, and develop an improved version whose results are also evaluated.
Original language | English |
---|---|
Title of host publication | Proceedings of the 1st international symposium on Information and communication technologies (ISICT '03) |
Place of Publication | Dublin |
Publisher | Trinity College Dublin |
Pages | 304-309 |
DOIs | |
Publication status | Published - 2003 |
Keywords
- clustering
- computational lexicography
- natural language processing
- computational linguistics
- terminology
- definitions