Nature Communications (Dec 2023)
Open access repository-scale propagated nearest neighbor suspect spectral library for untargeted metabolomics
- Wout Bittremieux,
- Nicole E. Avalon,
- Sydney P. Thomas,
- Sarvar A. Kakhkhorov,
- Alexander A. Aksenov,
- Paulo Wender P. Gomes,
- Christine M. Aceves,
- Andrés Mauricio Caraballo-Rodríguez,
- Julia M. Gauglitz,
- William H. Gerwick,
- Tao Huan,
- Alan K. Jarmusch,
- Rima F. Kaddurah-Daouk,
- Kyo Bin Kang,
- Hyun Woo Kim,
- Todor Kondić,
- Helena Mannochio-Russo,
- Michael J. Meehan,
- Alexey V. Melnik,
- Louis-Felix Nothias,
- Claire O’Donovan,
- Morgan Panitchpakdi,
- Daniel Petras,
- Robin Schmid,
- Emma L. Schymanski,
- Justin J. J. van der Hooft,
- Kelly C. Weldon,
- Heejung Yang,
- Shipei Xing,
- Jasmine Zemlin,
- Mingxun Wang,
- Pieter C. Dorrestein
Affiliations
- Wout Bittremieux
- Department of Computer Science, University of Antwerp
- Nicole E. Avalon
- Scripps Institution of Oceanography, University of California San Diego
- Sydney P. Thomas
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego
- Sarvar A. Kakhkhorov
- Laboratory of Physical and Chemical Methods of Research, Center for Advanced Technologies
- Alexander A. Aksenov
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego
- Paulo Wender P. Gomes
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego
- Christine M. Aceves
- Department of Immunology and Microbiology, The Scripps Research Institute
- Andrés Mauricio Caraballo-Rodríguez
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego
- Julia M. Gauglitz
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego
- William H. Gerwick
- Scripps Institution of Oceanography, University of California San Diego
- Tao Huan
- Department of Chemistry, University of British Columbia
- Alan K. Jarmusch
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego
- Rima F. Kaddurah-Daouk
- Department of Psychiatry and Behavioral Sciences, Duke University School of Medicine
- Kyo Bin Kang
- College of Pharmacy and Research Institute of Pharmaceutical Sciences, Sookmyung Women’s University
- Hyun Woo Kim
- College of Pharmacy and Integrated Research Institute for Drug Development, Dongguk University
- Todor Kondić
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg
- Helena Mannochio-Russo
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego
- Michael J. Meehan
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego
- Alexey V. Melnik
- Department of Chemistry, University of Connecticut
- Louis-Felix Nothias
- Université Côte d’Azur, CNRS, ICN
- Claire O’Donovan
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus
- Morgan Panitchpakdi
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego
- Daniel Petras
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego
- Robin Schmid
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego
- Emma L. Schymanski
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg
- Justin J. J. van der Hooft
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego
- Kelly C. Weldon
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego
- Heejung Yang
- Laboratory of Natural Products Chemistry, College of Pharmacy, Kangwon National University
- Shipei Xing
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego
- Jasmine Zemlin
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego
- Mingxun Wang
- Department of Computer Science and Engineering, University of California Riverside
- Pieter C. Dorrestein
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego
- DOI
- https://doi.org/10.1038/s41467-023-44035-y
- Journal volume & issue
-
Vol. 14,
no. 1
pp. 1 – 15
Abstract
Abstract Despite the increasing availability of tandem mass spectrometry (MS/MS) community spectral libraries for untargeted metabolomics over the past decade, the majority of acquired MS/MS spectra remain uninterpreted. To further aid in interpreting unannotated spectra, we created a nearest neighbor suspect spectral library, consisting of 87,916 annotated MS/MS spectra derived from hundreds of millions of MS/MS spectra originating from published untargeted metabolomics experiments. Entries in this library, or “suspects,” were derived from unannotated spectra that could be linked in a molecular network to an annotated spectrum. Annotations were propagated to unknowns based on structural relationships to reference molecules using MS/MS-based spectrum alignment. We demonstrate the broad relevance of the nearest neighbor suspect spectral library through representative examples of propagation-based annotation of acylcarnitines, bacterial and plant natural products, and drug metabolism. Our results also highlight how the library can help to better understand an Alzheimer’s brain phenotype. The nearest neighbor suspect spectral library is openly available for download or for data analysis through the GNPS platform to help investigators hypothesize candidate structures for unknown MS/MS spectra in untargeted metabolomics data.