Cardio-Oncology (Oct 2024)

Building a machine learning-assisted echocardiography prediction tool for children at risk for cancer therapy-related cardiomyopathy

  • Lindsay A. Edwards,
  • Christina Yang,
  • Surbhi Sharma,
  • Zih-Hua Chen,
  • Lahari Gorantla,
  • Sanika A. Joshi,
  • Nicolas J. Longhi,
  • Nahom Worku,
  • Jamie S. Yang,
  • Brandy Martinez Di Pietro,
  • Saro Armenian,
  • Aarti Bhat,
  • William Border,
  • Sujatha Buddhe,
  • Nancy Blythe,
  • Kayla Stratton,
  • Kasey J. Leger,
  • Wendy M. Leisenring,
  • Lillian R. Meacham,
  • Paul C. Nathan,
  • Shanti Narasimhan,
  • Ritu Sachdeva,
  • Karim Sadak,
  • Eric J. Chow,
  • Patrick M. Boyle

DOI
https://doi.org/10.1186/s40959-024-00268-4
Journal volume & issue
Vol. 10, no. 1
pp. 1 – 11

Abstract

Read online

Abstract Background Despite routine echocardiographic surveillance for childhood cancer survivors, the ability to predict cardiomyopathy risk in individual patients is limited. We explored the feasibility and optimal processes for machine learning-enhanced cardiomyopathy prediction in survivors using serial echocardiograms from five centers. Methods We designed a series of deep convolutional neural networks (DCNNs) for prediction of cardiomyopathy (shortening fraction ≤ 28% or ejection fraction ≤ 50% on two occasions) for at-risk survivors ≥ 1-year post initial cancer therapy. We built DCNNs with four subsets of echocardiographic data differing in timing relative to case (survivor who developed cardiomyopathy) index diagnosis and two input formats (montages) with differing image selections. We used holdout subsets in a 10-fold cross-validation framework and standard metrics to assess model performance (e.g., F1-score, area under the precision-recall curve [AUPRC]). Performance of the input formats was compared using a combined 5 × 2 cross-validation F-test. Results The dataset included 542 pairs of montages: 171 montage pairs from 45 cases at time of cardiomyopathy diagnosis or pre-diagnosis and 371 pairs from 70 at-risk survivors who didn’t develop cardiomyopathy during follow-up (non-case). The DCNN trained to distinguish between non-case and time of cardiomyopathy diagnosis or pre-diagnosis case montages achieved an AUROC of 0.89 ± 0.02, AUPRC 0.83 ± 0.03, and F1-score: 0.76 ± 0.04. When limited to smaller subsets of case data (e.g., ≥ 1 or 2 years pre-diagnosis), performance worsened. Model input format did not impact performance accuracy across models. Conclusions This methodology is a promising first step toward development of a DCNN capable of accurately differentiating pre-diagnosis versus non-case echocardiograms to predict survivors more likely to develop cardiomyopathy. Graphical Abstract

Keywords