LipidMatch: an automated workflow for rule-based lipid identification using untargeted high-resolution tandem mass spectrometry data

Jeremy P. Koelmel; Nicholas M. Kroeger; Candice Z. Ulmer; John A. Bowden; Rainey E. Patterson; Jason A. Cochran; Christopher W. W. Beecher; Timothy J. Garrett; Richard A. Yost

doi:10.1186/s12859-017-1744-3

BMC Bioinformatics (Jul 2017)

LipidMatch: an automated workflow for rule-based lipid identification using untargeted high-resolution tandem mass spectrometry data

Jeremy P. Koelmel,
Nicholas M. Kroeger,
Candice Z. Ulmer,
John A. Bowden,
Rainey E. Patterson,
Jason A. Cochran,
Christopher W. W. Beecher,
Timothy J. Garrett,
Richard A. Yost

Affiliations

Jeremy P. Koelmel: Department of Chemistry, University of Florida
Nicholas M. Kroeger: College of Engineering, University of Florida, 412
Candice Z. Ulmer: Department of Chemistry, University of Florida
John A. Bowden: National Institute of Standards and Technology, Hollings Marine Laboratory
Rainey E. Patterson: Department of Chemistry, University of Florida
Jason A. Cochran: College of Engineering, University of Florida, 412
Christopher W. W. Beecher: Clinical and Translational Science Institute, University of Florida
Timothy J. Garrett: Department of Chemistry, University of Florida
Richard A. Yost: Department of Chemistry, University of Florida

DOI: https://doi.org/10.1186/s12859-017-1744-3
Journal volume & issue: Vol. 18, no. 1
pp. 1 – 11

Abstract

Read online

Abstract Background Lipids are ubiquitous and serve numerous biological functions; thus lipids have been shown to have great potential as candidates for elucidating biomarkers and pathway perturbations associated with disease. Methods expanding coverage of the lipidome increase the likelihood of biomarker discovery and could lead to more comprehensive understanding of disease etiology. Results We introduce LipidMatch, an R-based tool for lipid identification for liquid chromatography tandem mass spectrometry workflows. LipidMatch currently has over 250,000 lipid species spanning 56 lipid types contained in in silico fragmentation libraries. Unique fragmentation libraries, compared to other open source software, include oxidized lipids, bile acids, sphingosines, and previously uncharacterized adducts, including ammoniated cardiolipins. LipidMatch uses rule-based identification. For each lipid type, the user can select which fragments must be observed for identification. Rule-based identification allows for correct annotation of lipids based on the fragments observed, unlike typical identification based solely on spectral similarity scores, where over-reporting structural details that are not conferred by fragmentation data is common. Another unique feature of LipidMatch is ranking lipid identifications for a given feature by the sum of fragment intensities. For each lipid candidate, the intensities of experimental fragments with exact mass matches to expected in silico fragments are summed. The lipid identifications with the greatest summed intensity using this ranking algorithm were comparable to other lipid identification software annotations, MS-DIAL and Greazy. For example, for features with identifications from all 3 software, 92% of LipidMatch identifications by fatty acyl constituents were corroborated by at least one other software in positive mode and 98% in negative ion mode. Conclusions LipidMatch allows users to annotate lipids across a wide range of high resolution tandem mass spectrometry experiments, including imaging experiments, direct infusion experiments, and experiments employing liquid chromatography. LipidMatch leverages the most extensive in silico fragmentation libraries of freely available software. When integrated into a larger lipidomics workflow, LipidMatch may increase the probability of finding lipid-based biomarkers and determining etiology of disease by covering a greater portion of the lipidome and using annotation which does not over-report biologically relevant structural details of identified lipid molecules.

Published in BMC Bioinformatics

ISSN: 1471-2105 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Science: Biology (General)
Website: http://www.biomedcentral.com/bmcbioinformatics/

About the journal

Abstract

Keywords