Scientific Reports (Feb 2024)

Integrating imaging and genomic data for the discovery of distinct glioblastoma subtypes: a joint learning approach

  • Jun Guo,
  • Anahita Fathi Kazerooni,
  • Erik Toorens,
  • Hamed Akbari,
  • Fanyang Yu,
  • Chiharu Sako,
  • Elizabeth Mamourian,
  • Russell T. Shinohara,
  • Constantinos Koumenis,
  • Stephen J. Bagley,
  • Jennifer J. D. Morrissette,
  • Zev A. Binder,
  • Steven Brem,
  • Suyash Mohan,
  • Robert A. Lustig,
  • Donald M. O’Rourke,
  • Tapan Ganguly,
  • Spyridon Bakas,
  • MacLean P. Nasrallah,
  • Christos Davatzikos

DOI
https://doi.org/10.1038/s41598-024-55072-y
Journal volume & issue
Vol. 14, no. 1
pp. 1 – 11

Abstract

Read online

Abstract Glioblastoma is a highly heterogeneous disease, with variations observed at both phenotypical and molecular levels. Personalized therapies would be facilitated by non-invasive in vivo approaches for characterizing this heterogeneity. In this study, we developed unsupervised joint machine learning between radiomic and genomic data, thereby identifying distinct glioblastoma subtypes. A retrospective cohort of 571 IDH-wildtype glioblastoma patients were included in the study, and pre-operative multi-parametric MRI scans and targeted next-generation sequencing (NGS) data were collected. L21-norm minimization was used to select a subset of 12 radiomic features from the MRI scans, and 13 key driver genes from the five main signal pathways most affected in glioblastoma were selected from the genomic data. Subtypes were identified using a joint learning approach called Anchor-based Partial Multi-modal Clustering on both radiomic and genomic modalities. Kaplan–Meier analysis identified three distinct glioblastoma subtypes: high-risk, medium-risk, and low-risk, based on overall survival outcome (p < 0.05, log-rank test; Hazard Ratio = 1.64, 95% CI 1.17–2.31, Cox proportional hazard model on high-risk and low-risk subtypes). The three subtypes displayed different phenotypical and molecular characteristics in terms of imaging histogram, co-occurrence of genes, and correlation between the two modalities. Our findings demonstrate the synergistic value of integrated radiomic signatures and molecular characteristics for glioblastoma subtyping. Joint learning on both modalities can aid in better understanding the molecular basis of phenotypical signatures of glioblastoma, and provide insights into the biological underpinnings of tumor formation and progression.