Inferring spatial transcriptomics markers from whole slide images to characterize metastasis-related spatial heterogeneity of colorectal tumors: A pilot study

Michael Fatemi; Eric Feng; Cyril Sharma; Zarif Azher; Tarushii Goel; Ojas Ramwala; Scott M. Palisoul; Rachael E. Barney; Laurent Perreard; Fred W. Kolling; Lucas A. Salas; Brock C. Christensen; Gregory J. Tsongalis; Louis J. Vaickus; Joshua J. Levy

Journal of Pathology Informatics (Jan 2023)

Inferring spatial transcriptomics markers from whole slide images to characterize metastasis-related spatial heterogeneity of colorectal tumors: A pilot study

Michael Fatemi,
Eric Feng,
Cyril Sharma,
Zarif Azher,
Tarushii Goel,
Ojas Ramwala,
Scott M. Palisoul,
Rachael E. Barney,
Laurent Perreard,
Fred W. Kolling,
Lucas A. Salas,
Brock C. Christensen,
Gregory J. Tsongalis,
Louis J. Vaickus,
Joshua J. Levy

Affiliations

Michael Fatemi: Department of Computer Science, University of Virginia, Charlottesville, VA, USA
Eric Feng: Thomas Jefferson High School for Science and Technology, Alexandria, VA, USA
Cyril Sharma: Department of Computer Science, Purdue University, West Lafayette, IN, USA
Zarif Azher: Thomas Jefferson High School for Science and Technology, Alexandria, VA, USA
Tarushii Goel: Department of Computer Science, Massachusetts Institute of Technology, Cambridge, MA, USA
Ojas Ramwala: Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA, USA
Scott M. Palisoul: Emerging Diagnostic and Investigative Technologies, Department of Pathology and Laboratory Medicine, Dartmouth Health, Lebanon, NH, USA
Rachael E. Barney: Emerging Diagnostic and Investigative Technologies, Department of Pathology and Laboratory Medicine, Dartmouth Health, Lebanon, NH, USA
Laurent Perreard: Dartmouth Cancer Center, Lebanon, NH, USA
Fred W. Kolling: Dartmouth Cancer Center, Lebanon, NH, USA
Lucas A. Salas: Department of Epidemiology, Dartmouth College Geisel School of Medicine, Hanover, NH, USA; Department of Molecular and Systems Biology, Dartmouth College Geisel School of Medicine, Hanover, NH, USA; Integrative Neuroscience at Dartmouth (IND) graduate program, Dartmouth College Geisel School of Medicine, Hanover, NH, USA
Brock C. Christensen: Department of Epidemiology, Dartmouth College Geisel School of Medicine, Hanover, NH, USA; Department of Molecular and Systems Biology, Dartmouth College Geisel School of Medicine, Hanover, NH, USA; Department of Community and Family Medicine, Dartmouth College Geisel School of Medicine, Hanover, NH, USA
Gregory J. Tsongalis: Emerging Diagnostic and Investigative Technologies, Department of Pathology and Laboratory Medicine, Dartmouth Health, Lebanon, NH, USA
Louis J. Vaickus: Emerging Diagnostic and Investigative Technologies, Department of Pathology and Laboratory Medicine, Dartmouth Health, Lebanon, NH, USA
Joshua J. Levy: Emerging Diagnostic and Investigative Technologies, Department of Pathology and Laboratory Medicine, Dartmouth Health, Lebanon, NH, USA; Department of Epidemiology, Dartmouth College Geisel School of Medicine, Hanover, NH, USA; Department of Dermatology, Dartmouth Health, Lebanon, NH, USA; Program in Quantitative Biomedical Sciences, Dartmouth College Geisel School of Medicine, Hanover, NH, USA; Corresponding author at: Department of Pathology and Laboratory Medicine, Dartmouth-Hitchcock Medical Center, 1 Medical Center Drive, Lebanon, NH, USA.

Journal volume & issue: Vol. 14
p. 100308

Abstract

Read online

Over 150 000 Americans are diagnosed with colorectal cancer (CRC) every year, and annually over 50 000 individuals will die from CRC, necessitating improvements in screening, prognostication, disease management, and therapeutic options. Tumor metastasis is the primary factor related to the risk of recurrence and mortality. Yet, screening for nodal and distant metastasis is costly, and invasive and incomplete resection may hamper adequate assessment. Signatures of the tumor-immune microenvironment (TIME) at the primary site can provide valuable insights into the aggressiveness of the tumor and the effectiveness of various treatment options. Spatially resolved transcriptomics technologies offer an unprecedented characterization of TIME through high multiplexing, yet their scope is constrained by cost. Meanwhile, it has long been suspected that histological, cytological, and macroarchitectural tissue characteristics correlate well with molecular information (e.g., gene expression). Thus, a method for predicting transcriptomics data through inference of RNA patterns from whole slide images (WSI) is a key step in studying metastasis at scale. In this work, we collected tissue from 4 stage-III (pT3) matched colorectal cancer patients for spatial transcriptomics profiling. The Visium spatial transcriptomics (ST) assay was used to measure transcript abundance for 17 943 genes at up to 5000 55-micron (i.e., 1–10 cells) spots per patient sampled in a honeycomb pattern, co-registered with hematoxylin and eosin (H&E) stained WSI. The Visium ST assay can measure expression at these spots through tissue permeabilization of mRNAs, which are captured through spatially (i.e., x–y positional coordinates) barcoded, gene specific oligo probes. WSI subimages were extracted around each co-registered Visium spot and were used to predict the expression at these spots using machine learning models. We prototyped and compared several convolutional, transformer, and graph convolutional neural networks to predict spatial RNA patterns at the Visium spots under the hypothesis that the transformer- and graph-based approaches better capture relevant spatial tissue architecture. We further analyzed the model’s ability to recapitulate spatial autocorrelation statistics using SPARK and SpatialDE. Overall, the results indicate that the transformer- and graph-based approaches were unable to outperform the convolutional neural network architecture, though they exhibited optimal performance for relevant disease-associated genes. Initial findings suggest that different neural networks that operate on different scales are relevant for capturing distinct disease pathways (e.g., epithelial to mesenchymal transition). We add further evidence that deep learning models can accurately predict gene expression in whole slide images and comment on understudied factors which may increase its external applicability (e.g., tissue context). Our preliminary work will motivate further investigation of inference for molecular patterns from whole slide images as metastasis predictors and in other applications.

Published in Journal of Pathology Informatics

ISSN: 2229-5089 (Print); 2153-3539 (Online)
Publisher: Elsevier
Country of publisher: United States
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Medicine: Pathology
Website: https://www.journals.elsevier.com/journal-of-pathology-informatics

About the journal

Abstract

Keywords