Metabolites (Jan 2023)

Benchmarking Outlier Detection Methods for Detecting IEM Patients in Untargeted Metabolomics Data

  • Michiel Bongaerts,
  • Purva Kulkarni,
  • Alan Zammit,
  • Ramon Bonte,
  • Leo A. J. Kluijtmans,
  • Henk J. Blom,
  • Udo F. H. Engelke,
  • David M. J. Tax,
  • George J. G. Ruijter,
  • Marcel J. T. Reinders

DOI
https://doi.org/10.3390/metabo13010097
Journal volume & issue
Vol. 13, no. 1
p. 97

Abstract

Read online

Untargeted metabolomics (UM) is increasingly being deployed as a strategy for screening patients that are suspected of having an inborn error of metabolism (IEM). In this study, we examined the potential of existing outlier detection methods to detect IEM patient profiles. We benchmarked 30 different outlier detection methods when applied to three untargeted metabolomics datasets. Our results show great differences in IEM detection performances across the various methods. The methods DeepSVDD and R-graph performed most consistently across the three metabolomics datasets. For datasets with a more balanced number of samples-to-features ratio, we found that AE reconstruction error, Mahalanobis and PCA reconstruction error also performed well. Furthermore, we demonstrated the importance of a PCA transform prior to applying an outlier detection method since we observed that this increases the performance of several outlier detection methods. For only one of the three metabolomics datasets, we observed clinically satisfying performances for some outlier detection methods, where we were able to detect 90% of the IEM patient samples while detecting no false positives. These results suggest that outlier detection methods have the potential to aid the clinical investigator in routine screening for IEM using untargeted metabolomics data, but also show that further improvements are needed to ensure clinically satisfying performances.

Keywords