Axioms (Aug 2024)
A Robust High-Dimensional Test for Two-Sample Comparisons
Abstract
The Hotelling T2 statistic is used to compare the mean vectors of two independent multivariate Gaussian distributions. Nevertheless, this statistic is highly sensitive to outliers and is not suitable for high-dimensional datasets where the number of variables exceeds the sample size. This study introduces a robust permutation test based on the minimum regularized covariance determinant (MRCD) estimator to address these limitations of the two-sample Hotelling T2 statistic. Simulation studies were performed to evaluate the proposed test’s empirical size, power, and robustness. Additionally, the test was applied to both uncontaminated and contaminated Alzheimer’s Disease datasets. The findings from the simulations and real data examples provide clues that the proposed test can be effectively used with high-dimensional data without being impacted by outliers. Finally, an R function within the “MVTests” package was developed to implement the proposed test statistic on real-world data.
Keywords