Molecules (Apr 2021)

Statistics of the Popularity of Chemical Compounds in Relation to the Non-Target Analysis

  • Boris L. Milman,
  • Inna K. Zhurkovich

DOI
https://doi.org/10.3390/molecules26082394
Journal volume & issue
Vol. 26, no. 8
p. 2394

Abstract

Read online

The idea of popularity/abundance of chemical compounds is widely used in non-target chemical analysis involving environmental studies. To have a clear quantitative basis for this idea, frequency distributions of chemical compounds over indicators of their popularity/abundance are obtained and discussed. Popularity indicators are the number of information sources, the number of chemical vendors, counts of data records, and other variables assessed from two large databases, namely ChemSpider and PubChem. Distributions are approximated by power functions, special cases of Zipf distributions, which are characteristic of the results of human/social activity. Relatively small group of the most popular compounds has been denoted, conventionally accounting for a few percent (several million) of compounds. These compounds are most often explored in scientific research and are practically used. Accordingly, popular compounds have been taken into account as first analyte candidates for identification in non-target analysis.

Keywords