iScience (Jun 2024)

PredCoffee: A binary classification approach specifically for coffee odor

  • Yi He,
  • Ruirui Huang,
  • Ruoyu Zhang,
  • Fei He,
  • Lu Han,
  • Weiwei Han

Journal volume & issue
Vol. 27, no. 6
p. 110041

Abstract

Read online

Summary: Compared to traditional methods, using machine learning to assess or predict the odor of molecules can save costs in various aspects. Our research aims to collect molecules with coffee odor and summarize the regularity of these molecules, ultimately creating a binary classifier that can determine whether a molecule has a coffee odor. In this study, a total of 371 coffee-odor molecules and 9,700 non-coffee-odor molecules were collected. The Knowledge-guided Pre-training of Graph Transformer (KPGT), support vector machine (SVM), random forest (RF), multi-layer perceptron (MLP), and message-passing neural networks (MPNN) were used to train the data. The model with the best performance was selected as the basis of the predictor. The prediction accuracy value of the KPGT model exceeded 0.84 and the predictor has been deployed as a webserver PredCoffee.

Keywords