Abstract
This paper investigates multi-topic aspects in automatic classification of clinical free text in comparison with general text. In this paper, we facilitate two different views on multi-topics: the Closed Topic Assumption (CTA) and the Open Topic Assumption (OTA). Experimental results show that the characteristics of multi-topic assignments in the Computational Medicine Centre (CMC) Medical NLP Challenge Data is strongly OTA-oriented but general text Reuters-21578 is characterised in the middle of the OTA and CTA spectrum. Copyright © 2009 Inderscience Enterprises Ltd.
Original language | English |
---|---|
Pages (from-to) | 299-313 |
Number of pages | 14 |
Journal | International Journal of Data Mining and Bioinformatics |
Volume | 3 |
Issue number | 3 |
DOIs | |
Publication status | Published - 2009 |