BioData Mining (Apr 2009)

Statistical quality assessment and outlier detection for liquid chromatography-mass spectrometry experiments

  • Schulz-Trieglaff Ole,
  • Machtejevas Egidijus,
  • Reinert Knut,
  • Schlüter Hartmut,
  • Thiemann Joachim,
  • Unger Klaus

DOI
https://doi.org/10.1186/1756-0381-2-4
Journal volume & issue
Vol. 2, no. 1
p. 4

Abstract

Read online

Abstract Background Quality assessment methods, that are common place in engineering and industrial production, are not widely spread in large-scale proteomics experiments. But modern technologies such as Multi-Dimensional Liquid Chromatography coupled to Mass Spectrometry (LC-MS) produce large quantities of proteomic data. These data are prone to measurement errors and reproducibility problems such that an automatic quality assessment and control become increasingly important. Results We propose a methodology to assess the quality and reproducibility of data generated in quantitative LC-MS experiments. We introduce quality descriptors that capture different aspects of the quality and reproducibility of LC-MS data sets. Our method is based on the Mahalanobis distance and a robust Principal Component Analysis. Conclusion We evaluate our approach on several data sets of different complexities and show that we are able to precisely detect LC-MS runs of poor signal quality in large-scale studies.