BMC Medicine (Dec 2019)

Calibration: the Achilles heel of predictive analytics

  • Ben Van Calster,
  • David J. McLernon,
  • Maarten van Smeden,
  • Laure Wynants,
  • Ewout W. Steyerberg,
  • On behalf of Topic Group ‘Evaluating diagnostic tests and prediction models’ of the STRATOS initiative

DOI
https://doi.org/10.1186/s12916-019-1466-7
Journal volume & issue
Vol. 17, no. 1
pp. 1 – 7

Abstract

Read online

Abstract Background The assessment of calibration performance of risk prediction models based on regression or more flexible machine learning algorithms receives little attention. Main text Herein, we argue that this needs to change immediately because poorly calibrated algorithms can be misleading and potentially harmful for clinical decision-making. We summarize how to avoid poor calibration at algorithm development and how to assess calibration at algorithm validation, emphasizing balance between model complexity and the available sample size. At external validation, calibration curves require sufficiently large samples. Algorithm updating should be considered for appropriate support of clinical practice. Conclusion Efforts are required to avoid poor calibration when developing prediction models, to evaluate calibration when validating models, and to update models when indicated. The ultimate aim is to optimize the utility of predictive analytics for shared decision-making and patient counseling.

Keywords