Medizinische Universität Graz Austria/Österreich - Forschungsportal - Medical University of Graz

Logo MUG-Forschungsportal

Gewählte Publikation:

SHR Neuro Krebs Kardio Lipid Stoffw Microb

Pfeifer, B; Sirocchi, C; Bloice, MD; Kreuzthaler, M; Urschler, M.
Federated unsupervised random forest for privacy-preserving patient stratification.
Bioinformatics. 2024; 40(Supplement_2): ii198-ii207. Doi: 10.1093/bioinformatics/btae382 [OPEN ACCESS]
Web of Science PubMed PUBMED Central FullText FullText_MUG

 

Führende Autor*innen der Med Uni Graz
Pfeifer Bastian
Co-Autor*innen der Med Uni Graz
Bloice Marcus
Kreuzthaler Markus Eduard
Urschler Martin
Altmetrics:

Dimensions Citations:

Plum Analytics:

Scite (citation analytics):

Abstract:
MOTIVATION: In the realm of precision medicine, effective patient stratification and disease subtyping demand innovative methodologies tailored for multi-omics data. Clustering techniques applied to multi-omics data have become instrumental in identifying distinct subgroups of patients, enabling a finer-grained understanding of disease variability. Meanwhile, clinical datasets are often small and must be aggregated from multiple hospitals. Online data sharing, however, is seen as a significant challenge due to privacy concerns, potentially impeding big data's role in medical advancements using machine learning. This work establishes a powerful framework for advancing precision medicine through unsupervised random forest-based clustering in combination with federated computing. RESULTS: We introduce a novel multi-omics clustering approach utilizing unsupervised random forests. The unsupervised nature of the random forest enables the determination of cluster-specific feature importance, unraveling key molecular contributors to distinct patient groups. Our methodology is designed for federated execution, a crucial aspect in the medical domain where privacy concerns are paramount. We have validated our approach on machine learning benchmark datasets as well as on cancer data from The Cancer Genome Atlas. Our method is competitive with the state-of-the-art in terms of disease subtyping, but at the same time substantially improves the cluster interpretability. Experiments indicate that local clustering performance can be improved through federated computing. AVAILABILITY AND IMPLEMENTATION: The proposed methods are available as an R-package (https://github.com/pievos101/uRF).
Find related publications in this database (using NLM MeSH Indexing)
Humans - administration & dosage
Cluster Analysis - administration & dosage
Precision Medicine - methods
Unsupervised Machine Learning - administration & dosage
Machine Learning - administration & dosage
Neoplasms - administration & dosage
Privacy - administration & dosage
Algorithms - administration & dosage
Random Forest - administration & dosage

© Med Uni Graz Impressum