Medical image analysis using improved SAM-Med2D: segmentation and classification perspectives

Jiakang Sun; Ke Chen; Zhiyi He; Siyuan Ren; Xinyang He; Xu Liu; Cheng Peng

doi:10.1186/s12880-024-01401-6

BMC Medical Imaging (Sep 2024)

Medical image analysis using improved SAM-Med2D: segmentation and classification perspectives

Jiakang Sun,
Ke Chen,
Zhiyi He,
Siyuan Ren,
Xinyang He,
Xu Liu,
Cheng Peng

Affiliations

Jiakang Sun: Chengdu Institute of Computer Application, Chinese Academy of Sciences
Ke Chen: Chengdu Institute of Computer Application, Chinese Academy of Sciences
Zhiyi He: Chengdu Institute of Computer Application, Chinese Academy of Sciences
Siyuan Ren: Chengdu Institute of Computer Application, Chinese Academy of Sciences
Xinyang He: Chengdu Institute of Computer Application, Chinese Academy of Sciences
Xu Liu: Chengdu Institute of Computer Application, Chinese Academy of Sciences
Cheng Peng: Chengdu Institute of Computer Application, Chinese Academy of Sciences

DOI: https://doi.org/10.1186/s12880-024-01401-6
Journal volume & issue: Vol. 24, no. 1
pp. 1 – 17

Abstract

Read online

Abstract Recently emerged SAM-Med2D represents a state-of-the-art advancement in medical image segmentation. Through fine-tuning the Large Visual Model, Segment Anything Model (SAM), on extensive medical datasets, it has achieved impressive results in cross-modal medical image segmentation. However, its reliance on interactive prompts may restrict its applicability under specific conditions. To address this limitation, we introduce SAM-AutoMed, which achieves automatic segmentation of medical images by replacing the original prompt encoder with an improved MobileNet v3 backbone. The performance on multiple datasets surpasses both SAM and SAM-Med2D. Current enhancements on the Large Visual Model SAM lack applications in the field of medical image classification. Therefore, we introduce SAM-MedCls, which combines the encoder of SAM-Med2D with our designed attention modules to construct an end-to-end medical image classification model. It performs well on datasets of various modalities, even achieving state-of-the-art results, indicating its potential to become a universal model for medical image classification.

Published in BMC Medical Imaging

ISSN: 1471-2342 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Medical technology
Website: http://bmcmedimaging.biomedcentral.com

About the journal

Abstract

Keywords