BMC Medical Imaging (Sep 2024)

Medical image analysis using improved SAM-Med2D: segmentation and classification perspectives

  • Jiakang Sun,
  • Ke Chen,
  • Zhiyi He,
  • Siyuan Ren,
  • Xinyang He,
  • Xu Liu,
  • Cheng Peng

DOI
https://doi.org/10.1186/s12880-024-01401-6
Journal volume & issue
Vol. 24, no. 1
pp. 1 – 17

Abstract

Read online

Abstract Recently emerged SAM-Med2D represents a state-of-the-art advancement in medical image segmentation. Through fine-tuning the Large Visual Model, Segment Anything Model (SAM), on extensive medical datasets, it has achieved impressive results in cross-modal medical image segmentation. However, its reliance on interactive prompts may restrict its applicability under specific conditions. To address this limitation, we introduce SAM-AutoMed, which achieves automatic segmentation of medical images by replacing the original prompt encoder with an improved MobileNet v3 backbone. The performance on multiple datasets surpasses both SAM and SAM-Med2D. Current enhancements on the Large Visual Model SAM lack applications in the field of medical image classification. Therefore, we introduce SAM-MedCls, which combines the encoder of SAM-Med2D with our designed attention modules to construct an end-to-end medical image classification model. It performs well on datasets of various modalities, even achieving state-of-the-art results, indicating its potential to become a universal model for medical image classification.

Keywords