Selected Publication:
SHR
Neuro
Cancer
Cardio
Lipid
Metab
Microb
Pfeifer, B; Gevaert, A; Loecher, M; Holzinger, A.
Tree smoothing: Post-hoc regularization of tree ensembles for interpretable machine learning
INFORM SCIENCES. 2025; 690: 121564
Doi: 10.1016/j.ins.2024.121564
Web of Science
FullText
FullText_MUG
- Leading authors Med Uni Graz
-
Holzinger Andreas
-
Pfeifer Bastian
- Altmetrics:
- Dimensions Citations:
- Plum Analytics:
- Scite (citation analytics):
- Abstract:
- Random Forests (RFs) are powerful ensemble learning algorithms that are widely used in various machine learning tasks. However, they tend to overfit noisy or irrelevant features, which can result in decreased generalization performance. Post-hoc regularization techniques aim to solve this problem by modifying the structure of the learned ensemble after training. We propose a novel post-hoc regularization via tree smoothing for classification tasks to leverage the reliable class distributions closer to the root node whilst reducing the impact of more specific and potentially noisy splits deeper in the tree. Our novel approach allows for a form of pruning that does not alter the general structure of the trees, adjusting the influence of nodes based on their proximity to the root node. We evaluated the performance of our method on various machine learning benchmark data sets and on cancer data from The Cancer Genome Atlas (TCGA). Our approach demonstrates competitive performance compared to the state-of-the-art and, in the majority of cases, and outperforms it in most cases in terms of prediction accuracy, generalization, and interpretability.
- Find related publications in this database (Keywords)
-
Explainable artificial intelligence
-
XAI
-
Random forests
-
Regularization
-
Feature importance