Metabolites (Nov 2021)
Coupling Mixed Mode Chromatography/ESI Negative MS Detection with Message-Passing Neural Network Modeling for Enhanced Metabolome Coverage and Structural Identification
Abstract
A key unmet need in metabolomics continues to be the specific, selective, accurate detection of traditionally difficult to retain molecules including simple sugars, sugar phosphates, carboxylic acids, and related amino acids. Designed to retain the metabolites of central carbon metabolism, this Mixed Mode (MM) chromatography applies varied pH, salt concentration and organic content to a positively charged quaternary amine polyvinyl alcohol stationary phase. This MM method is capable of separating glucose from fructose, and four hexose monophosphates a single chromatographic run. Coupled to a QExactive Orbitrap Mass Spectrometer with negative ESI, linearity, LLOD, %CV, and mass accuracy were assessed using 33 metabolite standards. The standards were linear on average >3 orders of magnitude (R2 > 0.98 for 30/33) with LLOD < 1 pmole (26/33), median CV of 12% over two weeks, and median mass accuracy of 0.49 ppm. To assess the breadth of metabolome coverage and better define the structural elements dictating elution, we injected 607 unique metabolites and determined that 398 are well retained. We then split the dataset of 398 documented RTs into training and test sets and trained a message-passing neural network (MPNN) to predict RT from a featurized heavy atom connectivity graph. Unlike traditional QSAR methods that utilize hand-crafted descriptors or pre-defined structural keys, the MPNN aggregates atomic features across the molecular graph and learns to identify molecular subgraphs that are correlated with variations in RTs. For sugars, sugar phosphates, carboxylic acids, and isomers, the model achieves a predictive RT error of <2 min on 91%, 50%, 77%, and 72% of held-out compounds from these subsets, with overall root mean square errors of 0.11, 0.34, 0.18, and 0.53 min, respectively. The model was then applied to rank order metabolite IDs for molecular features altered by GLS2 knockout in mouse primary hepatocytes.
Keywords