Unsupervised annotation of phenotypic abnormalities via semantic latent representations on electronic health records

Jingqing Zhang, Kai Sun, Xian Yang, Chengliang Dai, Yike Guo

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The extraction of phenotype information which is naturally contained in electronic health records (EHRs) has been found to be useful in various clinical informatics applications such as disease diagnosis. However, due to imprecise descriptions, lack of gold standards and the demand for efficiency, annotating phenotypic abnormalities on millions of EHR narratives is still challenging. In this work, we propose a novel unsupervised deep learning framework to annotate the phenotypic abnormalities from EHRs via semantic latent representations. The proposed framework takes the advantage of Human Phenotype Ontology (HPO), which is a knowledge base of phenotypic abnormalities, to standardize the annotation results. Experiments have been conducted on 52,722 EHRs from MIMIC-III dataset. Quantitative and qualitative analysis have shown the proposed framework achieves state-of-the-art annotation performance and computational efficiency compared with other methods.
Original languageEnglish
Title of host publication2019 IEEE International Conference on Bioinformatics and Biomedicine
Subtitle of host publicationNovember 18-21, 2019, San Diego, CA, USA
EditorsIllhoi Yoo, Jinbo Bi, Xiaohua Hu
Place of PublicationUSA
PublisherIEEE
Pages598-603
Number of pages6
ISBN (Electronic)9781728118673
ISBN (Print)9781728118680
DOIs
Publication statusPublished - 2019

Keywords

  • phenotype annotation
  • unsupervised learning
  • natural language processing
  • deep learning
  • electronic health records

Fingerprint

Dive into the research topics of 'Unsupervised annotation of phenotypic abnormalities via semantic latent representations on electronic health records'. Together they form a unique fingerprint.

Cite this