Abstract
Medical care data is a valuable resource that can be used for many purposes including managing and planning for future health needs as well as clinical research. However, the heterogeneity and complexity of medical data can be an obstacle in applying data mining techniques. Much of the potential value of this data therefore goes untapped. In this paper we have developed a methodology that reduces the dimensionality of primary care data, in order to make it more amenable to visualisation, mining and clustering. The methodology involves employing a combination of ontology-based semantic similarity and principal component analysis (PCA) to map the data into an appropriate and informative low dimensional space. Throughout the study, we had access to anonymised patient data from primary care in Salford, UK. The results of our application of this methodology show that diagnosis codes in primary care data can be used to map patients into an informative low dimensional space, which in turn provides the opportunity to support further data exploration and medical hypothesis formulation. © 2013 IMIA and IOS Press.
Original language | English |
---|---|
Title of host publication | Studies in Health Technology and Informatics|Stud. Health Technol. Informatics |
Pages | 52-56 |
Number of pages | 4 |
Volume | 192 |
DOIs | |
Publication status | Published - 2013 |
Event | 14th World Congress on Medical and Health Informatics, MEDINFO 2013 - Copenhagen Duration: 1 Jul 2013 → … |
Conference
Conference | 14th World Congress on Medical and Health Informatics, MEDINFO 2013 |
---|---|
City | Copenhagen |
Period | 1/07/13 → … |
Keywords
- Data Mining
- Electronic Health Records
- Primary Health Care
- Principal Component Analysis
- Semantics