Medizinische Universität Graz - Research portal

Logo MUG Resarch Portal

Selected Publication:

SHR Neuro Cancer Cardio Lipid Metab Microb

López-García, P; Boeker, M; Illarramendi, A; Schulz, S.
Usability-driven pruning of large ontologies: the case of SNOMED CT.
J Am Med Inform Assoc. 2012; 19(e1):e102-e109 Doi: 10.1136/amiajnl-2011-000503 [OPEN ACCESS]
Web of Science PubMed PUBMED Central FullText FullText_MUG

 

Leading authors Med Uni Graz
Lopez Garcia Pablo
Co-authors Med Uni Graz
Schulz Stefan
Altmetrics:

Dimensions Citations:

Plum Analytics:

Scite (citation analytics):

Abstract:
Objectives To study ontology modularization techniques when applied to SNOMED CT in a scenario in which no previous corpus of information exists and to examine if frequency-based filtering using MEDLINE can reduce subset size without discarding relevant concepts. Materials and Methods Subsets were first extracted using four graph-traversal heuristics and one logic-based technique, and were subsequently filtered with frequency information from MEDLINE. Twenty manually coded discharge summaries from cardiology patients were used as signatures and test sets. The coverage, size, and precision of extracted subsets were measured. Results Graph-traversal heuristics provided high coverage (71-96% of terms in the test sets of discharge summaries) at the expense of subset size (17-51% of the size of SNOMED CT). Pre-computed subsets and logic-based techniques extracted small subsets (1%), but coverage was limited (24-55%). Filtering reduced the size of large subsets to 10% while still providing 80% coverage. Discussion Extracting subsets to annotate discharge summaries is challenging when no previous corpus exists. Ontology modularization provides valuable techniques, but the resulting modules grow as signatures spread across subhierarchies, yielding a very low precision. Conclusion Graph-traversal strategies and frequency data from an authoritative source can prune large biomedical ontologies and produce useful subsets that still exhibit acceptable coverage. However, a clinical corpus closer to the specific use case is preferred when available.
Find related publications in this database (using NLM MeSH Indexing)
Cardiology - classification
Humans -
MEDLINE -
Patient Discharge -
Systematized Nomenclature of Medicine -

© Med Uni GrazImprint