EBioMedicine (Apr 2023)

Development and validation of echocardiography-based machine-learning models to predict mortalityResearch in context

  • Akshay Valsaraj,
  • Sunil Vasu Kalmady,
  • Vaibhav Sharma,
  • Matthew Frost,
  • Weijie Sun,
  • Nariman Sepehrvand,
  • Marcus Ong,
  • Cyril Equibec,
  • Jason R.B. Dyck,
  • Todd Anderson,
  • Harald Becher,
  • Sarah Weeks,
  • Jasper Tromp,
  • Chung-Lieh Hung,
  • Justin A. Ezekowitz,
  • Padma Kaul

Journal volume & issue
Vol. 90
p. 104479

Abstract

Read online

Summary: Background: Echocardiography (echo) based machine learning (ML) models may be useful in identifying patients at high-risk of all-cause mortality. Methods: We developed ML models (ResNet deep learning using echo videos and CatBoost gradient boosting using echo measurements) to predict 1-year, 3-year, and 5-year mortality. Models were trained on the Mackay dataset, Taiwan (6083 echos, 3626 patients) and validated in the Alberta HEART dataset, Canada (997 echos, 595 patients). We examined the performance of the models overall, and in subgroups (healthy controls, at risk of heart failure (HF), HF with reduced ejection fraction (HFrEF) and HF with preserved ejection fraction (HFpEF)). We compared the models' performance to the MAGGIC risk score, and examined the correlation between the models’ predicted probability of death and baseline quality of life as measured by the Kansas City Cardiomyopathy Questionnaire (KCCQ). Findings: Mortality rates at 1-, 3- and 5-years were 14.9%, 28.6%, and 42.5% in the Mackay cohort, and 3.0%, 10.3%, and 18.7%, in the Alberta HEART cohort. The ResNet and CatBoost models achieved area under the receiver-operating curve (AUROC) between 85% and 92% in internal validation. In external validation, the AUROCs for the ResNet (82%, 82%, and 78%) were significantly better than CatBoost (78%, 73%, and 75%), for 1-, 3- and 5-year mortality prediction respectively, with better or comparable performance to the MAGGIC score. ResNet models predicted higher probability of death in the HFpEF and HFrEF (30%–50%) subgroups than in controls and at risk patients (5%–20%). The predicted probabilities of death correlated with KCCQ scores (all p < 0.05). Interpretation: Echo-based ML models to predict mortality had good internal and external validity, were generalizable, correlated with patients’ quality of life, and are comparable to an established HF risk score. These models can be leveraged for automated risk stratification at point-of-care. Funding: Funding for Alberta HEART was provided by an Alberta Innovates - Health Solutions Interdisciplinary Team Grant no. AHFMR ITG 200801018. P.K. holds a Canadian Institutes of Health Research (CIHR) Sex and Gender Science Chair and a Heart & Stroke Foundation Chair in Cardiovascular Research. A.V. and V.S. received funding from the Mitacs Globalink Research Internship.

Keywords