Abstract
We present an approach to classification of biomedical terms based on the information acquired automatically from the corpus of relevant literature. The learning phase consists of two stages: acquisition of terminologically relevant contextual patterns (CPs) and selection of classes that apply to terms used with these patterns. CPs represent a generalisation of similar term contexts in the form of regular expressions containing lexical, syntactic and terminological information. The most probable classes for the training terms co-occurring with the statistically relevant CP are learned by a genetic algorithm. Term classification is based on the learnt results. First, each term is associated with the most frequently co-occurring CP. Classes attached to such CP are initially suggested as the term's potential classes. Then, the term is finally mapped to the most similar suggested class. © Springer-Verlag Berlin Heidelberg 2004.
Original language | English |
---|---|
Title of host publication | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)|Lect. Notes Comput. Sci. |
Publisher | Springer Nature |
Pages | 345-351 |
Number of pages | 6 |
Volume | 3177 |
Publication status | Published - 2004 |
Event | Intelligent Data Engineering and Automated Learning - IDEAL 2004, 5th International Conference, Exeter, UK, August 25-27, 2004, Proceedings - Duration: 1 Jan 1824 → … http://dblp.uni-trier.de/db/conf/ideal/ideal2004.html#SpasicNA04http://dblp.uni-trier.de/rec/bibtex/conf/ideal/SpasicNA04.xmlhttp://dblp.uni-trier.de/rec/bibtex/conf/ideal/SpasicNA04 |
Publication series
Name | Lecture Notes in Computer Science |
---|
Conference
Conference | Intelligent Data Engineering and Automated Learning - IDEAL 2004, 5th International Conference, Exeter, UK, August 25-27, 2004, Proceedings |
---|---|
Period | 1/01/24 → … |
Internet address |