Selected Publication:
SHR
Neuro
Cancer
Cardio
Lipid
Metab
Microb
López-García, P; Boeker, M; Illarramendi, A; Schulz, S.
Usability-driven pruning of large ontologies: the case of SNOMED CT.
J Am Med Inform Assoc. 2012; 19(e1):e102-e109
Doi: 10.1136/amiajnl-2011-000503
[OPEN ACCESS]
Web of Science
PubMed
FullText
FullText_MUG
- Leading authors Med Uni Graz
-
Lopez Garcia Pablo
- Co-authors Med Uni Graz
-
Schulz Stefan
- Altmetrics:
- Dimensions Citations:
- Plum Analytics:
- Scite (citation analytics):
- Abstract:
- Objectives To study ontology modularization techniques when applied to SNOMED CT in a scenario in which no previous corpus of information exists and to examine if frequency-based filtering using MEDLINE can reduce subset size without discarding relevant concepts. Materials and Methods Subsets were first extracted using four graph-traversal heuristics and one logic-based technique, and were subsequently filtered with frequency information from MEDLINE. Twenty manually coded discharge summaries from cardiology patients were used as signatures and test sets. The coverage, size, and precision of extracted subsets were measured. Results Graph-traversal heuristics provided high coverage (71-96% of terms in the test sets of discharge summaries) at the expense of subset size (17-51% of the size of SNOMED CT). Pre-computed subsets and logic-based techniques extracted small subsets (1%), but coverage was limited (24-55%). Filtering reduced the size of large subsets to 10% while still providing 80% coverage. Discussion Extracting subsets to annotate discharge summaries is challenging when no previous corpus exists. Ontology modularization provides valuable techniques, but the resulting modules grow as signatures spread across subhierarchies, yielding a very low precision. Conclusion Graph-traversal strategies and frequency data from an authoritative source can prune large biomedical ontologies and produce useful subsets that still exhibit acceptable coverage. However, a clinical corpus closer to the specific use case is preferred when available.
- Find related publications in this database (using NLM MeSH Indexing)
-
Cardiology - classification
-
Humans -
-
MEDLINE -
-
Patient Discharge -
-
Systematized Nomenclature of Medicine -