Selected Publication:
SHR
Neuro
Cancer
Cardio
Lipid
Metab
Microb
Pfeifer, B; Sirocchi, C; Bloice, MD; Kreuzthaler, M; Urschler, M.
Federated unsupervised random forest for privacy-preserving patient stratification.
Bioinformatics. 2024; 40(Supplement_2): ii198-ii207.
Doi: 10.1093/bioinformatics/btae382
[OPEN ACCESS]
Web of Science
PubMed
FullText
FullText_MUG
- Leading authors Med Uni Graz
-
Pfeifer Bastian
- Co-authors Med Uni Graz
-
Bloice Marcus
-
Kreuzthaler Markus Eduard
-
Urschler Martin
- Altmetrics:
- Dimensions Citations:
- Plum Analytics:
- Scite (citation analytics):
- Abstract:
- MOTIVATION: In the realm of precision medicine, effective patient stratification and disease subtyping demand innovative methodologies tailored for multi-omics data. Clustering techniques applied to multi-omics data have become instrumental in identifying distinct subgroups of patients, enabling a finer-grained understanding of disease variability. Meanwhile, clinical datasets are often small and must be aggregated from multiple hospitals. Online data sharing, however, is seen as a significant challenge due to privacy concerns, potentially impeding big data's role in medical advancements using machine learning. This work establishes a powerful framework for advancing precision medicine through unsupervised random forest-based clustering in combination with federated computing. RESULTS: We introduce a novel multi-omics clustering approach utilizing unsupervised random forests. The unsupervised nature of the random forest enables the determination of cluster-specific feature importance, unraveling key molecular contributors to distinct patient groups. Our methodology is designed for federated execution, a crucial aspect in the medical domain where privacy concerns are paramount. We have validated our approach on machine learning benchmark datasets as well as on cancer data from The Cancer Genome Atlas. Our method is competitive with the state-of-the-art in terms of disease subtyping, but at the same time substantially improves the cluster interpretability. Experiments indicate that local clustering performance can be improved through federated computing. AVAILABILITY AND IMPLEMENTATION: The proposed methods are available as an R-package (https://github.com/pievos101/uRF).
- Find related publications in this database (using NLM MeSH Indexing)
-
Humans - administration & dosage
-
Cluster Analysis - administration & dosage
-
Precision Medicine - methods
-
Unsupervised Machine Learning - administration & dosage
-
Machine Learning - administration & dosage
-
Neoplasms - administration & dosage
-
Privacy - administration & dosage
-
Algorithms - administration & dosage
-
Random Forest - administration & dosage