Frontiers in Pharmacology (Jul 2021)

A Comparison of Nine Machine Learning Mutagenicity Models and Their Application for Predicting Pyrrolizidine Alkaloids

  • Christoph Helma,
  • Verena Schöning,
  • Jürgen Drewe,
  • Jürgen Drewe,
  • Philipp Boss

DOI
https://doi.org/10.3389/fphar.2021.708050
Journal volume & issue
Vol. 12

Abstract

Read online

Random forest, support vector machine, logistic regression, neural networks and k-nearest neighbor (lazar) algorithms, were applied to a new Salmonella mutagenicity dataset with 8,290 unique chemical structures utilizing MolPrint2D and Chemistry Development Kit (CDK) descriptors. Crossvalidation accuracies of all investigated models ranged from 80 to 85% which is comparable with the interlaboratory variability of the Salmonella mutagenicity assay. Pyrrolizidine alkaloid predictions showed a clear distinction between chemical groups, where otonecines had the highest proportion of positive mutagenicity predictions and monoesters the lowest.

Keywords