Communications Chemistry (Feb 2024)
A spectroscopic test suggests that fragment ion structure annotations in MS/MS libraries are frequently incorrect
Abstract
Abstract Modern untargeted mass spectrometry (MS) analyses quickly detect and resolve thousands of molecular compounds. Although features are readily annotated with a molecular formula in high-resolution small-molecule MS applications, the large majority of them remains unidentified in terms of their full molecular structure. Collision-induced dissociation tandem mass spectrometry (CID-MS2) provides a diagnostic molecular fingerprint to resolve the molecular structure through a library search. However, for de novo identifications, one must often rely on in silico generated MS2 spectra as reference. The ability of different in silico algorithms to correctly predict MS2 spectra and thus to retrieve correct molecular structures is a topic of lively debate, for instance in the CASMI contest. Underlying the predicted MS2 spectra are the in silico generated product ion structures, which are normally not used in de novo identification, but which can serve to critically assess the fragmentation algorithms. Here we evaluate in silico generated MSn product ion structures by comparison with structures established experimentally by infrared ion spectroscopy (IRIS). For a set of three dozen product ion structures from five precursor molecules, we find that virtually all fragment ion structure annotations in three major in silico MS2 libraries (HMDB, METLIN, mzCloud) are incorrect and caution the reader against their use for structure annotation of MS/MS ions.