Nature Communications (Nov 2024)

Digital profiling of gene expression from histology images with linearized attention

  • Marija Pizurica,
  • Yuanning Zheng,
  • Francisco Carrillo-Perez,
  • Humaira Noor,
  • Wei Yao,
  • Christian Wohlfart,
  • Antoaneta Vladimirova,
  • Kathleen Marchal,
  • Olivier Gevaert

DOI
https://doi.org/10.1038/s41467-024-54182-5
Journal volume & issue
Vol. 15, no. 1
pp. 1 – 14

Abstract

Read online

Abstract Cancer is a heterogeneous disease requiring costly genetic profiling for better understanding and management. Recent advances in deep learning have enabled cost-effective predictions of genetic alterations from whole slide images (WSIs). While transformers have driven significant progress in non-medical domains, their application to WSIs lags behind due to high model complexity and limited dataset sizes. Here, we introduce S E Q U O I A, a linearized transformer model that predicts cancer transcriptomic profiles from WSIs. S E Q U O I A is developed using 7584 tumor samples across 16 cancer types, with its generalization capacity validated on two independent cohorts comprising 1368 tumors. Accurately predicted genes are associated with key cancer processes, including inflammatory response, cell cycles and metabolism. Further, we demonstrate the value of S E Q U O I A in stratifying the risk of breast cancer recurrence and in resolving spatial gene expression at loco-regional levels. S E Q U O I A hence deciphers clinically relevant information from WSIs, opening avenues for personalized cancer management.