Development of deep learning model for diagnosing muscle-invasive bladder cancer on MRI with vision transformer
Yasuhisa Kurata,
Mizuho Nishio,
Yusaku Moribata,
Satoshi Otani,
Yuki Himoto,
Satoru Takahashi,
Jiro Kusakabe,
Ryota Okura,
Marina Shimizu,
Keisuke Hidaka,
Naoko Nishio,
Akihiko Furuta,
Aki Kido,
Kimihiko Masui,
Hiroyuki Onishi,
Takehiko Segawa,
Takashi Kobayashi,
Yuji Nakamoto
Affiliations
Yasuhisa Kurata
Department of Diagnostic Imaging and Nuclear Medicine, Graduate School of Medicine, Kyoto University, 54, Shogoin Kawahara-cho, Sakyo-Ku, Kyoto, 606-8507, Japan; Corresponding author. Department of Diagnostic Imaging and Nuclear Medicine, Graduate School of Medicine, Kyoto University, 54 Shogoin Kawahara-cho, Sakyo-Ku, Kyoto, 606-8507, Japan.
Mizuho Nishio
Department of Diagnostic Imaging and Nuclear Medicine, Graduate School of Medicine, Kyoto University, 54, Shogoin Kawahara-cho, Sakyo-Ku, Kyoto, 606-8507, Japan
Yusaku Moribata
Department of Radiology, Shiga General Hospital, 4-30, Moriyama 5-chome, Moriyama-shi, Shiga, 524-8524, Japan
Satoshi Otani
Department of Radiology, Kyoto City Hospital, 2-1 Mibu Higashi Takada-cho Nakagyo-ku, Kyoto, 604-8845, Japan
Yuki Himoto
Department of Diagnostic Imaging and Nuclear Medicine, Graduate School of Medicine, Kyoto University, 54, Shogoin Kawahara-cho, Sakyo-Ku, Kyoto, 606-8507, Japan
Satoru Takahashi
Department of Radiology, Takatsuki General Hospital, 1-3-13, Kosobe-Cho, Takatsuki-Shi, Osaka, 569-1192, Japan
Jiro Kusakabe
Department of General Surgery, Cleveland Clinic, Cleveland, OH, USA
Ryota Okura
Department of Radiology, Kyoto City Hospital, 2-1 Mibu Higashi Takada-cho Nakagyo-ku, Kyoto, 604-8845, Japan
Marina Shimizu
Department of Radiology, Kyoto City Hospital, 2-1 Mibu Higashi Takada-cho Nakagyo-ku, Kyoto, 604-8845, Japan
Keisuke Hidaka
Department of Radiology, Osaka Red Cross Hospital, 5-30 Fudegasakicho, Tennoji-ku, Osaka, 543-0027, Japan
Naoko Nishio
Department of Radiology, Osaka Red Cross Hospital, 5-30 Fudegasakicho, Tennoji-ku, Osaka, 543-0027, Japan
Akihiko Furuta
Department of Radiology, Osaka Red Cross Hospital, 5-30 Fudegasakicho, Tennoji-ku, Osaka, 543-0027, Japan
Aki Kido
Department of Diagnostic Imaging and Nuclear Medicine, Graduate School of Medicine, Kyoto University, 54, Shogoin Kawahara-cho, Sakyo-Ku, Kyoto, 606-8507, Japan
Kimihiko Masui
Department of Urology, Kyoto University Hospital, 54 Shogoin Kawahara-cho, Sakyo-Ku, Kyoto, 606-8507, Japan
Hiroyuki Onishi
Department of Urology, Osaka Red Cross Hospital, 5-30 Fudegasakicho, Tennoji-ku, Osaka, 543-0027, Japan
Takehiko Segawa
Department of Urology, Kyoto City Hospital, 2-1 Mibu Higashi Takada-cho Nakagyo-ku, Kyoto, 604-8845, Japan
Takashi Kobayashi
Department of Urology, Kyoto University Hospital, 54 Shogoin Kawahara-cho, Sakyo-Ku, Kyoto, 606-8507, Japan
Yuji Nakamoto
Department of Diagnostic Imaging and Nuclear Medicine, Graduate School of Medicine, Kyoto University, 54, Shogoin Kawahara-cho, Sakyo-Ku, Kyoto, 606-8507, Japan
Rationale and objectives: To develop and validate a deep learning (DL) model to automatically diagnose muscle-invasive bladder cancer (MIBC) on MRI with Vision Transformer (ViT). Materials and methods: This multicenter retrospective study included patients with BC who reported to two institutions between January 2016 and June 2020 (training dataset) and a third institution between May 2017 and May 2022 (test dataset). The diagnostic model for MIBC and the segmentation model for BC on MRI were developed using the training dataset with 5-fold cross-validation. ViT- and convolutional neural network (CNN)-based diagnostic models were developed and compared for diagnostic performance using the area under the curve (AUC). The performance of the diagnostic model with manual and auto-generated regions of interest (ROImanual and ROIauto, respectively) was validated on the test dataset and compared to that of radiologists (three senior and three junior radiologists) using Vesical Imaging Reporting and Data System scoring. Results: The training and test datasets included 170 and 53 patients, respectively. Mean AUC of the top 10 ViT-based models with 5-fold cross-validation outperformed those of the CNN-based models (0.831 ± 0.003 vs. 0.713 ± 0.007–0.812 ± 0.006, p < .001). The diagnostic model with ROImanual achieved AUC of 0.872 (95 % CI: 0.777, 0.968), which was comparable to that of junior radiologists (AUC = 0.862, 0.873, and 0.930). Semi-automated diagnosis with the diagnostic model with ROIauto achieved AUC of 0.815 (95 % CI: 0.696, 0.935). Conclusion: The DL model effectively diagnosed MIBC. The ViT-based model outperformed CNN-based models, highlighting its utility in medical image analysis.