Machine Learning: Science and Technology (Jan 2025)

FDDM: unsupervised medical image translation with a frequency-decoupled diffusion model

  • Yunxiang Li,
  • Hua-Chieh Shao,
  • Xiaoxue Qian,
  • You Zhang

DOI
https://doi.org/10.1088/2632-2153/adc656
Journal volume & issue
Vol. 6, no. 2
p. 025007

Abstract

Read online

Diffusion models have demonstrated significant potential in producing high-quality images in medical image translation to aid disease diagnosis, localization, and treatment. Nevertheless, current diffusion models often fall short when it comes to faithfully translating medical images. They struggle to accurately preserve anatomical structures, especially when working with unpaired datasets. In this study, we introduce the frequency decoupled diffusion model (FDDM) for magnetic resonance (MR)-to-computed tomography (CT) conversion. The differences between MR and CT images lie in both anatomical structures (e.g. the outlines of organs or bones) and the data distribution (e.g. intensity values and contrast within). Therefore, FDDM first converts anatomical information using an initial conversion module. Then, the converted anatomical information guides a subsequent diffusion model to generate high-quality CT images. Our diffusion model uses a dual-path reverse diffusion process for low-frequency and high-frequency information, achieving a better balance between image quality and anatomical accuracy. We extensively evaluated FDDM using two public datasets for brain MR-to-CT and pelvis MR-to-CT translations. The results show that FDDM outperforms generative adversarial network (GAN)-based, variational autoencoder (VAE)-based, and diffusion-based models. The evaluation metrics included Fréchet inception distance (FID), mean absolute error, mean squared error, structural similarity index measure, and Dice similarity coefficient (DICE). FDDM achieved the best scores on all metrics for both datasets, particularly excelling in FID, with scores of 25.9 for brain data and 29.2 for pelvis data, significantly outperforming the other methods. These results demonstrate that FDDM can generate high-quality target domain images while maintaining the accuracy of translated anatomical structures, thereby facilitating more precise/accurate downstream tasks including anatomy segmentation and radiotherapy planning.

Keywords