Scientific Reports (Oct 2023)
Comparative analysis of batch correction methods for FDG PET/CT using metabolic radiogenomic data of lung cancer patients
Abstract
Abstract In radiomics research, the issue of different instruments being used is significant. In this study, we compared three correction methods to reduce the batch effects in radiogenomic data from fluorodeoxyglucose (FDG) PET/CT images of lung cancer patients. Texture features of the FDG PET/CT images and genomic data were retrospectively obtained. The features were corrected with different methods: phantom correction, ComBat method, and Limma method. Batch effects were estimated using three analytic tools: principal component analysis (PCA), the k-nearest neighbor batch effect test (kBET), and the silhouette score. Finally, the associations of features and gene mutations were compared between each correction method. Although the kBET rejection rate and silhouette score were lower in the phantom-corrected data than in the uncorrected data, a PCA plot showed a similar variance. ComBat and Limma methods provided correction with low batch effects, and there was no significant difference in the results of the two methods. In ComBat- and Limma-corrected data, more texture features exhibited a significant association with the TP53 mutation than in those in the phantom-corrected data. This study suggests that correction with ComBat or Limma methods can be more effective or equally as effective as the phantom method in reducing batch effects.