Multiple Instance Bagging and Risk Histogram for Survival Time Analysis Based on Whole Slide Images of Brain Cancer Patients

Yu Ping Chang; Ya-Chun Yang; Sung-Nien Yu

doi:10.3390/info15120750

Information (Nov 2024)

Multiple Instance Bagging and Risk Histogram for Survival Time Analysis Based on Whole Slide Images of Brain Cancer Patients

Yu Ping Chang,
Ya-Chun Yang,
Sung-Nien Yu

Affiliations

Yu Ping Chang: Department of Electrical Engineering, National Chung Cheng University, Chiayi County 621301, Taiwan
Ya-Chun Yang: Department of Electrical Engineering, National Chung Cheng University, Chiayi County 621301, Taiwan
Sung-Nien Yu: Department of Electrical Engineering, National Chung Cheng University, Chiayi County 621301, Taiwan

DOI: https://doi.org/10.3390/info15120750
Journal volume & issue: Vol. 15, no. 12
p. 750

Abstract

Read online

This study tackles the challenges in computer-aided prognosis for glioblastoma multiforme, a highly aggressive brain cancer, using only whole slide images (WSIs) as input. Unlike traditional methods that rely on random selection or region-of-interest (ROI) extraction to choose meaningful subsets of patches representing the whole slide, we propose a multiple instance bagging approach. This method utilizes all patches extracted from the whole slide, employing different subsets in each training epoch, thereby leveraging information from the entire slide while keeping the training computationally feasible. Additionally, we developed a two-stage framework based on the ResNet-CBAM model which estimates not just the usual survival risk, but also predicts the actual survival time. Using risk scores of patches estimated from the risk estimation stage, a risk histogram can be constructed and used as input to train a survival time prediction model. A censor hinge loss based on root mean square error was also developed to handle censored data when training the regression model. Tests using the Cancer Genome Atlas Program’s glioblastoma public database yielded a concordance index of 73.16±2.15%, surpassing existing models. Log-rank testing on predicted high- and low-risk groups using the Kaplan–Meier method revealed a p-value of 3.88×10−9, well below the usual threshold of 0.005, indicating the model’s ability to significantly differentiate between the two groups. We also implemented a heatmap visualization method that provides interpretable risk assessments at the patch level, potentially aiding clinicians in identifying high-risk regions within WSIs. Notably, these results were achieved using 98% fewer parameters compared to state-of-the-art models.

Published in Information

ISSN: 2078-2489 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Information technology
Website: http://www.mdpi.com/journal/information/

About the journal

Abstract

Keywords