IET Image Processing (May 2024)
SSA‐UNet: Whole brain segmentation by U‐Net with squeeze‐and‐excitation block and self‐attention block from the 2.5D slice image
Abstract
Abstract Whole brain segmentation from magnetic resonance images (MRI) is crucial in diagnosing brain diseases and analyzing neuroimaging data. Despite advances through deep learning, challenges such as uneven gray distribution and the presence of artifacts still present hurdles in medical image processing. These limitations are often a result of insufficient spatial contextual information and lack of attention to important regions within existing models. To address these issues, this paper presents SSA‐UNet (Squeeze‐and‐Excitation and Self‐Attention UNet), a uniquely designed deep convolutional neural network that integrates spatial constraints by converting three consecutive 2D MRI slices into a single 2.5D image. This facilitates capturing inter‐slice dependencies effectively. Additionally, the newly formulated SSA block, which sequentially incorporates channel attention and Self‐Attention mechanisms, is placed before the decoders in the conventional U‐Net architecture. This enables the network to automatically weight different feature maps and focus more effectively on regions requiring precise segmentation. Rigorous evaluations on LPBA40 and IBSR18 datasets substantiate the remarkable improvements in accuracy and stability achieved by SSA‐UNet. Results indicate Dice coefficients of 98.38% and 97.47%, specificity of 99.69% and 99.57%, and sensitivity of 98.5% and 97.98% for the respective datasets. Compared to other existing models, SSA‐UNet shows significant improvements on both the LPBA40 and IBSR18 datasets. On the LPBA40 dataset, SSA‐UNet's Dice coefficient improved by 0.33% compared to the sub‐optimal model, while on the IBSR18 dataset, the improvement reached 1.78%. These empirical findings demonstrate SSA‐UNet's heightened capability in addressing the long‐standing challenges in MRI‐based whole‐brain segmentation.
Keywords