IEEE Access (Jan 2024)

A Multimodal Transfer Learning Approach Using PubMedCLIP for Medical Image Classification

  • Hong N. Dao,
  • Tuyen Nguyen,
  • Cherubin Mugisha,
  • Incheon Paik

DOI
https://doi.org/10.1109/ACCESS.2024.3401777
Journal volume & issue
Vol. 12
pp. 75496 – 75507

Abstract

Read online

Medical image data often face the problem of data scarcity and costly annotation processes. To overcome this, our study introduces a novel transfer learning method for medical image classification. We present a multimodal learning framework that incorporates the pre-trained PubMedCLIP model and multimodal feature fusion. Prompts of different complexities are combined with images as inputs to the proposed model. Our findings demonstrate that this approach significantly enhances image classification tasks while reducing the burden of annotation costs. Our study underscores the potential of PubMedCLIP in revolutionizing medical image analysis through its prompt-based approach and showcases the value of multi-modality for training robust models in healthcare. Code is available at:https://github.com/HongJapan/MTL_prompt_medical.git.

Keywords