Multivariable prediction models for health care spending using machine learning: a protocol of a systematic review

Andrew W. Huang; Martin Haslberger; Neto Coulibaly; Omar Galárraga; Arman Oganisian; Lazaros Belbasis; Orestis A. Panagiotou

doi:10.1186/s41512-022-00119-9

Diagnostic and Prognostic Research (Mar 2022)

Multivariable prediction models for health care spending using machine learning: a protocol of a systematic review

Andrew W. Huang,
Martin Haslberger,
Neto Coulibaly,
Omar Galárraga,
Arman Oganisian,
Lazaros Belbasis,
Orestis A. Panagiotou

Affiliations

Andrew W. Huang: Department of Health Services, Policy and Practice, Brown University School of Public Health
Martin Haslberger: QUEST Center, Berlin Institute of Health, Charité–Universitätsmedizin Berlin
Neto Coulibaly: Department of Health Services, Policy and Practice, Brown University School of Public Health
Omar Galárraga: Department of Health Services, Policy and Practice, Brown University School of Public Health
Arman Oganisian: Department of Biostatistics, Brown University School of Public Health
Lazaros Belbasis: Meta-Research Innovation Center Berlin, QUEST Center, Berlin Institute of Health, Charité–Universitätsmedizin Berlin
Orestis A. Panagiotou: Department of Health Services, Policy and Practice, Brown University School of Public Health

DOI: https://doi.org/10.1186/s41512-022-00119-9
Journal volume & issue: Vol. 6, no. 1
pp. 1 – 5

Abstract

Read online

Abstract Background With rising cost pressures on health care systems, machine-learning (ML)-based algorithms are increasingly used to predict health care costs. Despite their potential advantages, the successful implementation of these methods could be undermined by biases introduced in the design, conduct, or analysis of studies seeking to develop and/or validate ML models. The utility of such models may also be negatively affected by poor reporting of these studies. In this systematic review, we aim to evaluate the reporting quality, methodological characteristics, and risk of bias of ML-based prediction models for individual-level health care spending. Methods We will systematically search PubMed and Embase to identify studies developing, updating, or validating ML-based models to predict an individual’s health care spending for any medical condition, over any time period, and in any setting. We will exclude prediction models of aggregate-level health care spending, models used to infer causality, models using radiomics or speech parameters, models of non-clinically validated predictors (e.g., genomics), and cost-effectiveness analyses without predicting individual-level health care spending. We will extract data based on the Checklist for Critical Appraisal and Data Extraction for Systematic Reviews of Prediction Modeling Studies (CHARMS), previously published research, and relevant recommendations. We will assess the adherence of ML-based studies to the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) statement and examine the inclusion of transparency and reproducibility indicators (e.g. statements on data sharing). To assess the risk of bias, we will apply the Prediction model Risk Of Bias Assessment Tool (PROBAST). Findings will be stratified by study design, ML methods used, population characteristics, and medical field. Discussion Our systematic review will appraise the quality, reporting, and risk of bias of ML-based models for individualized health care cost prediction. This review will provide an overview of the available models and give insights into the strengths and limitations of using ML methods for the prediction of health spending.

Published in Diagnostic and Prognostic Research

ISSN: 2397-7523 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General)
Website: https://diagnprognres.biomedcentral.com/

About the journal

Abstract

Keywords