Nature Communications (Mar 2025)

General lightweight framework for vision foundation model supporting multi-task and multi-center medical image analysis

  • Senliang Lu,
  • Yehang Chen,
  • Yuan Chen,
  • Peijun Li,
  • Junqi Sun,
  • Changye Zheng,
  • Yujian Zou,
  • Bo Liang,
  • Mingwei Li,
  • Qinggeng Jin,
  • Enming Cui,
  • Wansheng Long,
  • Bao Feng

DOI
https://doi.org/10.1038/s41467-025-57427-z
Journal volume & issue
Vol. 16, no. 1
pp. 1 – 16

Abstract

Read online

Abstract The foundation model, trained on extensive and diverse datasets, has shown strong performance across numerous downstream tasks. Nevertheless, its application in the medical domain is significantly hindered by issues such as data volume, heterogeneity, and privacy concerns. Therefore, we propose the Vision Foundation Model General Lightweight (VFMGL) framework, which facilitates the decentralized construction of expert clinical models for various medical tasks. The VFMGL framework transfers general knowledge from large-parameter vision foundation models to construct lightweight, robust expert clinical models tailored to specific medical tasks. Through extensive experiments and analyses across a range of medical tasks and scenarios, we demonstrate that VFMGL achieves superior performance in both medical image classification and segmentation tasks, effectively managing the challenges posed by data heterogeneity. These results underscore the potential of VFMGL in advancing the efficacy and reliability of AI-driven medical diagnostics.