Bioengineering (Jan 2023)
Collinearity and Dimensionality Reduction in Radiomics: Effect of Preprocessing Parameters in Hypertrophic Cardiomyopathy Magnetic Resonance T1 and T2 Mapping
Abstract
Radiomics and artificial intelligence have the potential to become a valuable tool in clinical applications. Frequently, radiomic analyses through machine learning methods present issues caused by high dimensionality and multicollinearity, and redundant radiomic features are usually removed based on correlation analysis. We assessed the effect of preprocessing—in terms of voxel size resampling, discretization, and filtering—on correlation-based dimensionality reduction in radiomic features from cardiac T1 and T2 maps of patients with hypertrophic cardiomyopathy. For different combinations of preprocessing parameters, we performed a dimensionality reduction of radiomic features based on either Pearson’s or Spearman’s correlation coefficient, followed by the computation of the stability index. With varying resampling voxel size and discretization bin width, for both T1 and T2 maps, Pearson’s and Spearman’s dimensionality reduction produced a slightly different percentage of remaining radiomic features, with a relatively high stability index. For different filters, the remaining features’ stability was instead relatively low. Overall, the percentage of eliminated radiomic features through correlation-based dimensionality reduction was more dependent on resampling voxel size and discretization bin width for textural features than for shape or first-order features. Notably, correlation-based dimensionality reduction was less sensitive to preprocessing when considering radiomic features from T2 compared with T1 maps.
Keywords