Quality assurance of the content of a large DL-based terminology using mixed lexical and semantic criteria: Experience with SNOMED CT

Alan Rector, Luigi Iannone, Robert Stevens

    Research output: Chapter in Book/Conference proceedingConference contributionpeer-review

    Abstract

    SNOMED-CT is a large medical terminology based on description logic and mandated for use in the US, UK and several other countries. The hierarchies are known to contain many errors, but have so far proved difficult to analyse or quality assure. We present a series of methods and lessons learnt from experience in quality assuring a "module" of SNOMED for specific applications that we expect to generalize both to SNOMED as a whole and to other large ontologies. They feature a) dependence on domain exper-tise b) starting from classes selected for relevance to specific applications, c) tracing all errors to their root and verifying repairs by reclassification d) extraction of manageable-sized "modules"; e) mixed semantic and lexical criteria, and f) extensive use of scripting. They aim to reduce the cognitive load on experts by a) looking initially up-wards rather than downwards in the hierarchies, b) breaking up long lists of direct subclasses by introducing definitions for meaningful subcategories. Errors found range from simple mistakes to systematic errors in schemas. © 2011 ACM.
    Original languageEnglish
    Title of host publicationKCAP 2011 - Proceedings of the 2011 Knowledge Capture Conference|KCAP - Proc. Knowl. Capture Conf.
    PublisherAssociation for Computing Machinery
    Pages57-64
    Number of pages7
    ISBN (Print)9781450303965
    DOIs
    Publication statusPublished - 2011
    Event6th International Conference on Knowledge Capture, KCAP 2011 - Banff, AB
    Duration: 1 Jul 2011 → …

    Conference

    Conference6th International Conference on Knowledge Capture, KCAP 2011
    CityBanff, AB
    Period1/07/11 → …

    Keywords

    • description logics
    • modularization
    • ontologies
    • OWL
    • quality assurance

    Fingerprint

    Dive into the research topics of 'Quality assurance of the content of a large DL-based terminology using mixed lexical and semantic criteria: Experience with SNOMED CT'. Together they form a unique fingerprint.

    Cite this