Comparison of machine learning methods with logistic regression analysis in creating predictive models for risk of critical in-hospital events in COVID-19 patients on hospital admission
- PMID: 36437469
- PMCID: PMC9702742
- DOI: 10.1186/s12911-022-02057-4
Comparison of machine learning methods with logistic regression analysis in creating predictive models for risk of critical in-hospital events in COVID-19 patients on hospital admission
Abstract
Background: Machine learning (ML) algorithms have been trained to early predict critical in-hospital events from COVID-19 using patient data at admission, but little is known on how their performance compares with each other and/or with statistical logistic regression (LR). This prospective multicentre cohort study compares the performance of a LR and five ML models on the contribution of influencing predictors and predictor-to-event relationships on prediction model´s performance.
Methods: We used 25 baseline variables of 490 COVID-19 patients admitted to 8 hospitals in Germany (March-November 2020) to develop and validate (75/25 random-split) 3 linear (L1 and L2 penalty, elastic net [EN]) and 2 non-linear (support vector machine [SVM] with radial kernel, random forest [RF]) ML approaches for predicting critical events defined by intensive care unit transfer, invasive ventilation and/or death (composite end-point: 181 patients). Models were compared for performance (area-under-the-receiver-operating characteristic-curve [AUC], Brier score) and predictor importance (performance-loss metrics, partial-dependence profiles).
Results: Models performed close with a small benefit for LR (utilizing restricted cubic splines for non-linearity) and RF (AUC means: 0.763-0.731 [RF-L1]); Brier scores: 0.184-0.197 [LR-L1]). Top ranked predictor variables (consistently highest importance: C-reactive protein) were largely identical across models, except creatinine, which exhibited marginal (L1, L2, EN, SVM) or high/non-linear effects (LR, RF) on events.
Conclusions: Although the LR and ML models analysed showed no strong differences in performance and the most influencing predictors for COVID-19-related event prediction, our results indicate a predictive benefit from taking account for non-linear predictor-to-event relationships and effects. Future efforts should focus on leveraging data-driven ML technologies from static towards dynamic modelling solutions that continuously learn and adapt to changes in data environments during the evolving pandemic.
Trial registration number: NCT04659187.
Keywords: COVID-19; Clinical decision-making; Critical event prediction; Machine learning; Predictive models.
© 2022. The Author(s).
Conflict of interest statement
The authors declare that they have no competing interests.
Figures
Similar articles
-
Machine learning algorithms for predicting COVID-19 mortality in Ethiopia.BMC Public Health. 2024 Jun 28;24(1):1728. doi: 10.1186/s12889-024-19196-0. BMC Public Health. 2024. PMID: 38943093 Free PMC article.
-
Development and Validation of Unplanned Extubation Prediction Models Using Intensive Care Unit Data: Retrospective, Comparative, Machine Learning Study.J Med Internet Res. 2021 Aug 11;23(8):e23508. doi: 10.2196/23508. J Med Internet Res. 2021. PMID: 34382940 Free PMC article.
-
Can machine learning predict pharmacotherapy outcomes? An application study in osteoporosis.Comput Methods Programs Biomed. 2022 Oct;225:107028. doi: 10.1016/j.cmpb.2022.107028. Epub 2022 Jul 21. Comput Methods Programs Biomed. 2022. PMID: 35930862
-
Regression and Random Forest Machine Learning Have Limited Performance in Predicting Bowel Preparation in Veteran Population.Dig Dis Sci. 2022 Jul;67(7):2827-2841. doi: 10.1007/s10620-021-07113-z. Epub 2021 Jun 24. Dig Dis Sci. 2022. PMID: 34169434 Review.
-
Predicting renal function recovery and short-term reversibility among acute kidney injury patients in the ICU: comparison of machine learning methods and conventional regression.Ren Fail. 2022 Dec;44(1):1326-1337. doi: 10.1080/0886022X.2022.2107542. Ren Fail. 2022. PMID: 35930309 Free PMC article. Review.
Cited by
-
Life-Course Multidisciplinary Psychosocial Predictors of Dementia Among Older Adults: Results From the Health and Retirement Study.Innov Aging. 2024 Oct 18;8(11):igae092. doi: 10.1093/geroni/igae092. eCollection 2024. Innov Aging. 2024. PMID: 39544491 Free PMC article.
-
Artificial Intelligence Models in Health Information Exchange: A Systematic Review of Clinical Implications.Healthcare (Basel). 2023 Sep 19;11(18):2584. doi: 10.3390/healthcare11182584. Healthcare (Basel). 2023. PMID: 37761781 Free PMC article. Review.
-
Higher in-hospital mortality in SARS-CoV-2 omicron variant infection compared to influenza infection-Insights from the CORONA Germany study.PLoS One. 2023 Sep 27;18(9):e0292017. doi: 10.1371/journal.pone.0292017. eCollection 2023. PLoS One. 2023. PMID: 37756299 Free PMC article.
-
Supervised Machine Learning Models to Identify Early-Stage Symptoms of SARS-CoV-2.Sensors (Basel). 2022 Dec 21;23(1):40. doi: 10.3390/s23010040. Sensors (Basel). 2022. PMID: 36616638 Free PMC article.
References
-
- Tsui ELH, Lui CSM, Woo PPS, Cheung ATL, Lam PKW, Tang VTW, et al. Development of a data-driven COVID-19 prognostication tool to inform triage and step-down care for hospitatlised patients in Hong Kong: a population-based cohort study. BMC Med Inform Decis Mak. 2020;20(1):323. doi: 10.1186/s12911-020-01338-0. - DOI - PMC - PubMed
MeSH terms
Associated data
LinkOut - more resources
Full Text Sources
Medical
Research Materials