Machine Learning: Science and Technology (Jan 2024)

Generation model meets swin transformer for unsupervised low-dose CT reconstruction

  • Yu Li,
  • Xueqin Sun,
  • Sukai Wang,
  • Yingwei Qin,
  • Jinxiao Pan,
  • Ping Chen

DOI
https://doi.org/10.1088/2632-2153/ad370e
Journal volume & issue
Vol. 5, no. 2
p. 025005

Abstract

Read online

Computed tomography (CT) has evolved into an indispensable tool for clinical diagnosis. Reducing radiation dose crucially minimizes adverse effects but may introduce noise and artifacts in reconstructed images, affecting diagnostic processes for physicians. Scholars have tackled deep learning training instability by exploring diffusion models. Given the scarcity of clinical data, we propose the unsupervised image domain score generation model (UISG) for low-dose CT reconstruction. During training, normal-dose CT images are utilized as network inputs to train a score-based generative model that captures the prior distribution of CT images. In the iterative reconstruction, the initial CT image is obtained using a filtered back-projection algorithm. Subsequently, diffusion-based prior, high-frequency convolutional sparse coding prior, and data-consistency steps are employed to obtain the high-quality reconstructed image. Given the global characteristics of noise, the score network of the diffusion model utilizes a swin transformer structure to enhance the model’s ability to capture long-range dependencies. Furthermore, convolutional sparse coding is applied exclusively to the high-frequency components of the image, to prevent over-smoothing or the loss of crucial anatomical details during the denoising process. Quantitative and qualitative results indicate that UISG outperforms competing methods in terms of denoising and generalization performance.

Keywords