Selected Publication:
SHR
Neuro
Cancer
Cardio
Lipid
Metab
Microb
Koloi, A; Loukas, VS; Hourican, C; Sakellarios, A; Quax, R; Mishra, PP; Lehtimäki, T; Raitakari, OT; Papaloukas, C; Bosch, JA; März, W; Fotiadis, D.
Predicting early-stage coronary artery disease using machine learning and routine clinical biomarkers improved by augmented virtual data
EUR HEART J-DIGIT HL. 2024; 5(5): 542-550.
Doi: 10.1093/ehjdh/ztae049
[OPEN ACCESS]
Web of Science
PubMed
FullText
FullText_MUG
- Co-authors Med Uni Graz
-
März Winfried
- Altmetrics:
- Dimensions Citations:
- Plum Analytics:
- Scite (citation analytics):
- Abstract:
- AIMS: Coronary artery disease (CAD) is a highly prevalent disease with modifiable risk factors. In patients with suspected obstructive CAD, evaluating the pre-test probability model is crucial for diagnosis, although its accuracy remains controversial. Machine learning (ML) predictive models can help clinicians detect CAD early and improve outcomes. This study aimed to identify early-stage CAD using ML in conjunction with a panel of clinical and laboratory tests. METHODS AND RESULTS: The study sample included 3316 patients enrolled in the Ludwigshafen Risk and Cardiovascular Health (LURIC) study. A comprehensive array of attributes was considered, and an ML pipeline was developed. Subsequently, we utilized five approaches to generating high-quality virtual patient data to improve the performance of the artificial intelligence models. An extension study was carried out using data from the Young Finns Study (YFS) to assess the results' generalizability. Upon applying virtual augmented data, accuracy increased by approximately 5%, from 0.75 to -0.79 for random forests (RFs), and from 0.76 to -0.80 for Gradient Boosting (GB). Sensitivity showed a significant boost for RFs, rising by about 9.4% (0.81-0.89), while GB exhibited a 4.8% increase (0.83-0.87). Specificity showed a significant boost for RFs, rising by ∼24% (from 0.55 to 0.70), while GB exhibited a 37% increase (from 0.51 to 0.74). The extension analysis aligned with the initial study. CONCLUSION: Accurate predictions of angiographic CAD can be obtained using a set of routine laboratory markers, age, sex, and smoking status, holding the potential to limit the need for invasive diagnostic techniques. The extension analysis in the YFS demonstrated the potential of these findings in a younger population, and it confirmed applicability to atherosclerotic vascular disease.
- Find related publications in this database (Keywords)
-
Coronary artery disease
-
Machine learning
-
Classification algorithms
-
Data Augmentation