Metabolites (Aug 2019)

WiPP: Workflow for Improved Peak Picking for Gas Chromatography-Mass Spectrometry (GC-MS) Data

  • Nico Borgsmüller,
  • Yoann Gloaguen,
  • Tobias Opialla,
  • Eric Blanc,
  • Emilie Sicard,
  • Anne-Lise Royer,
  • Bruno Le Bizec,
  • Stéphanie Durand,
  • Carole Migné,
  • Mélanie Pétéra,
  • Estelle Pujos-Guillot,
  • Franck Giacomoni,
  • Yann Guitton,
  • Dieter Beule,
  • Jennifer Kirwan

DOI
https://doi.org/10.3390/metabo9090171
Journal volume & issue
Vol. 9, no. 9
p. 171

Abstract

Read online

Lack of reliable peak detection impedes automated analysis of large-scale gas chromatography-mass spectrometry (GC-MS) metabolomics datasets. Performance and outcome of individual peak-picking algorithms can differ widely depending on both algorithmic approach and parameters, as well as data acquisition method. Therefore, comparing and contrasting between algorithms is difficult. Here we present a workflow for improved peak picking (WiPP), a parameter optimising, multi-algorithm peak detection for GC-MS metabolomics. WiPP evaluates the quality of detected peaks using a machine learning-based classification scheme based on seven peak classes. The quality information returned by the classifier for each individual peak is merged with results from different peak detection algorithms to create one final high-quality peak set for immediate down-stream analysis. Medium- and low-quality peaks are kept for further inspection. By applying WiPP to standard compound mixes and a complex biological dataset, we demonstrate that peak detection is improved through the novel way to assign peak quality, an automated parameter optimisation, and results in integration across different embedded peak picking algorithms. Furthermore, our approach can provide an impartial performance comparison of different peak picking algorithms. WiPP is freely available on GitHub (https://github.com/bihealth/WiPP) under MIT licence.

Keywords