Journal of Chromatography Open (Nov 2023)

Region of interest selection for GC×GC–MS data using a pseudo fisher ratio moving window with connected components segmentation

  • Ryland T. Giebelhaus,
  • A. Paulina de la Mata,
  • James J. Harynuk

Journal volume & issue
Vol. 4
p. 100104

Abstract

Read online

Comprehensive two-dimensional gas chromatography mass spectrometry (GC×GC–MS) data present several challenges for analysis largely because chemical factors drift along the chromatographic modes across different chromatographic runs, and there is frequently a lack of reliable molecular ion measurements with which to align data across multiple samples. Tensor decomposition techniques such as Parallel Factor Analysis (PARAFAC2/PARAFAC2×N) allow analysts to deconvolve closely eluting signals for quantitative and qualitative purposes. These techniques make relatively few assumptions about chromatographic peak shapes or the relative abundance of noise and allow for highly accurate representations of the underlying chemical phenomena using well-characterized and scrutinized principles of chemometrics. However, expert intervention and supervision is required to select appropriate Regions of Interest (ROI) and numbers of chemical components present in each ROI. We previously reported an automated ROI selection algorithm for GC–MS data in Giebelhaus et al. where we observed the ratio of the first and second eigenvalues within a moving window across the entire chromatogram. Here, we present an extension of this work to automatically detect ROIs in GC×GC–MS chromatograms, while making no assumptions about peak shape. First, we calculate the probabilities of each acquisition being in a ROI, then apply connected components segmentation to discretize the regions of interest. For sparse chromatograms we found the algorithm detected spurious peaks. To address this, we implemented an iterative ROI selection process where we autoscaled the moving window to the standard deviation of the noise from the previous iteration. Using three user-defined parameters, we generated informative ROIs on a wide range of GC×GC-TOFMS chromatograms.

Keywords