International Journal of Molecular Sciences (Jul 2023)

A Random Forest Model for Peptide Classification Based on Virtual Docking Data

  • Hua Feng,
  • Fangyu Wang,
  • Ning Li,
  • Qian Xu,
  • Guanming Zheng,
  • Xuefeng Sun,
  • Man Hu,
  • Guangxu Xing,
  • Gaiping Zhang

DOI
https://doi.org/10.3390/ijms241411409
Journal volume & issue
Vol. 24, no. 14
p. 11409

Abstract

Read online

The affinity of peptides is a crucial factor in studying peptide–protein interactions. Despite the development of various techniques to evaluate peptide–receptor affinity, the results may not always reflect the actual affinity of the peptides accurately. The current study provides a free tool to assess the actual peptide affinity based on virtual docking data. This study employed a dataset that combined actual peptide affinity information (active and inactive) and virtual peptide–receptor docking data, and different machine learning algorithms were utilized. Compared with the other algorithms, the random forest (RF) algorithm showed the best performance and was used in building three RF models using different numbers of significant features (four, three, and two). Further analysis revealed that the four-feature RF model achieved the highest Accuracy of 0.714 in classifying an independent unknown peptide dataset designed with the PEDV spike protein, and it also revealed overfitting problems in the other models. This four-feature RF model was used to evaluate peptide affinity by constructing the relationship between the actual affinity and the virtual docking scores of peptides to their receptors.

Keywords