Patterns (Feb 2021)

Machine-learning-enhanced time-of-flight mass spectrometry analysis

  • Ye Wei,
  • Rama Srinivas Varanasi,
  • Torsten Schwarz,
  • Leonie Gomell,
  • Huan Zhao,
  • David J. Larson,
  • Binhan Sun,
  • Geng Liu,
  • Hao Chen,
  • Dierk Raabe,
  • Baptiste Gault

Journal volume & issue
Vol. 2, no. 2
p. 100192

Abstract

Read online

Summary: Mass spectrometry is a widespread approach used to work out what the constituents of a material are. Atoms and molecules are removed from the material and collected, and subsequently, a critical step is to infer their correct identities based on patterns formed in their mass-to-charge ratios and relative isotopic abundances. However, this identification step still mainly relies on individual users' expertise, making its standardization challenging, and hindering efficient data processing. Here, we introduce an approach that leverages modern machine learning technique to identify peak patterns in time-of-flight mass spectra within microseconds, outperforming human users without loss of accuracy. Our approach is cross-validated on mass spectra generated from different time-of-flight mass spectrometry (ToF-MS) techniques, offering the ToF-MS community an open-source, intelligent mass spectra analysis. The bigger picture: Time-of-flight mass spectrometry (ToF-MS) is a mainstream analytical technique widely used in biology, chemistry, and materials science. ToF-MS provides quantitative compositional analysis with high sensitivity across a wide dynamic range of mass-to-charge ratios. A critical step in ToF-MS is to infer the identity of the detected ions. Here, we introduce a machine-learning-enhanced algorithm to provide a user-independent approach to performing this identification using patterns from the natural isotopic abundances of individual atomic and molecular ions, without human labeling or prior knowledge of composition. Results from several materials and techniques are compared with those obtained by field experts. Our open-source, easy-to-implement, reliable analytic method accelerates this identification process. A wide range of ToF-MS-based applications can benefit from our approach, e.g., hunting for patterns of biomarkers or for contamination on solid surfaces in high-throughput data.

Keywords