Multi-modal co-learning with attention mechanism for head and neck tumor segmentation on 18FDG PET-CT

Min Jeong Cho; Donghwi Hwang; Si Young Yie; Jae Sung Lee

doi:10.1186/s40658-024-00670-y

EJNMMI Physics (Jul 2024)

Multi-modal co-learning with attention mechanism for head and neck tumor segmentation on 18FDG PET-CT

Min Jeong Cho,
Donghwi Hwang,
Si Young Yie,
Jae Sung Lee

Affiliations

Min Jeong Cho: Interdisciplinary Program in Bioengineering, Seoul National University College of Engineering
Donghwi Hwang: Department of Nuclear Medicine, Seoul National University College of Medicine
Si Young Yie: Interdisciplinary Program in Bioengineering, Seoul National University College of Engineering
Jae Sung Lee: Interdisciplinary Program in Bioengineering, Seoul National University College of Engineering

DOI: https://doi.org/10.1186/s40658-024-00670-y
Journal volume & issue: Vol. 11, no. 1
pp. 1 – 14

Abstract

Read online

Abstract Purpose Effective radiation therapy requires accurate segmentation of head and neck cancer, one of the most common types of cancer. With the advancement of deep learning, people have come up with various methods that use positron emission tomography-computed tomography to get complementary information. However, these approaches are computationally expensive because of the separation of feature extraction and fusion functions and do not make use of the high sensitivity of PET. We propose a new deep learning-based approach to alleviate these challenges. Methods We proposed a tumor region attention module that fully exploits the high sensitivity of PET and designed a network that learns the correlation between the PET and CT features using squeeze-and-excitation normalization (SE Norm) without separating the feature extraction and fusion functions. In addition, we introduce multi-scale context fusion, which exploits contextual information from different scales. Results The HECKTOR challenge 2021 dataset was used for training and testing. The proposed model outperformed the state-of-the-art models for medical image segmentation; in particular, the dice similarity coefficient increased by 8.78% compared to U-net. Conclusion The proposed network segmented the complex shape of the tumor better than the state-of-the-art medical image segmentation methods, accurately distinguishing between tumor and non-tumor regions.

Published in EJNMMI Physics

ISSN: 2197-7364 (Online)
Publisher: SpringerOpen
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Medical physics. Medical radiology. Nuclear medicine
Website: https://ejnmmiphys.springeropen.com/

About the journal

Abstract

Keywords