3D bi-directional transformer U-Net for medical image segmentation

Xiyao Fu; Zhexian Sun; Haoteng Tang; Eric M. Zou; Heng Huang; Yong Wang; Yong Wang; Yong Wang; Yong Wang; Liang Zhan

doi:10.3389/fdata.2022.1080715

Frontiers in Big Data (Jan 2023)

3D bi-directional transformer U-Net for medical image segmentation

Xiyao Fu,
Zhexian Sun,
Haoteng Tang,
Eric M. Zou,
Heng Huang,
Yong Wang,
Yong Wang,
Yong Wang,
Yong Wang,
Liang Zhan

Affiliations

Xiyao Fu: Department of Electrical and Computer Engineering, University of Pittsburgh, Pittsburgh, PA, United States
Zhexian Sun: Department of Biomedical Engineering, Washington University in St. Louis, St. Louis, MO, United States
Haoteng Tang: Department of Electrical and Computer Engineering, University of Pittsburgh, Pittsburgh, PA, United States
Eric M. Zou: Montgomery Blair High School Maryland, 51 University Blvd E, Silver Spring, MD, United States
Heng Huang: Department of Electrical and Computer Engineering, University of Pittsburgh, Pittsburgh, PA, United States
Yong Wang: Department of Biomedical Engineering, Washington University in St. Louis, St. Louis, MO, United States
Yong Wang: Department of Electrical and Systems Engineering, Washington University in St. Louis, St. Louis, MO, United States
Yong Wang: Department of Obstetrics and Gynecology, Washington University in St. Louis, St. Louis, MO, United States
Yong Wang: Department of Radiology, Washington University in St. Louis, St. Louis, MO, United States
Liang Zhan: Department of Electrical and Computer Engineering, University of Pittsburgh, Pittsburgh, PA, United States

DOI: https://doi.org/10.3389/fdata.2022.1080715
Journal volume & issue: Vol. 5

Abstract

Read online

As one of the popular deep learning methods, deep convolutional neural networks (DCNNs) have been widely adopted in segmentation tasks and have received positive feedback. However, in segmentation tasks, DCNN-based frameworks are known for their incompetence in dealing with global relations within imaging features. Although several techniques have been proposed to enhance the global reasoning of DCNN, these models are either not able to gain satisfying performances compared with traditional fully-convolutional structures or not capable of utilizing the basic advantages of CNN-based networks (namely the ability of local reasoning). In this study, compared with current attempts to combine FCNs and global reasoning methods, we fully extracted the ability of self-attention by designing a novel attention mechanism for 3D computation and proposed a new segmentation framework (named 3DTU) for three-dimensional medical image segmentation tasks. This new framework processes images in an end-to-end manner and executes 3D computation on both the encoder side (which contains a 3D transformer) and the decoder side (which is based on a 3D DCNN). We tested our framework on two independent datasets that consist of 3D MRI and CT images. Experimental results clearly demonstrate that our method outperforms several state-of-the-art segmentation methods in various metrics.

Published in Frontiers in Big Data

ISSN: 2624-909X (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Information technology
Website: https://www.frontiersin.org/journals/big-data

About the journal

Abstract

Keywords