Pharmaceutics (Jul 2025)
Predicting Antimicrobial Peptide Activity: A Machine Learning-Based Quantitative Structure–Activity Relationship Approach
Abstract
Background: Peptides are a class of molecules that can be presented as good antimicrobials and with mechanisms that avoid resistance, and the design of peptides with good activity can be complex and laborious. The study of their quantitative structure–activity relationships through machine learning algorithms can shed light on a rational and effective design. Methods: Information on the antimicrobial activity of peptides was collected, and their structures were characterized by molecular descriptors generation to design regression and classification models based on machine learning algorithms. The contribution of each descriptor in the generated models was evaluated by determining its relative importance and, finally, the antimicrobial activity of new peptides was estimated. Results: A structured database of antimicrobial peptides and their descriptors was obtained, with which 56 machine learning models were generated. Random Forest-based models showed better performance, and of these, regression models showed variable performance (R2 = 0.339–0.574), while classification models showed good performance (MCC = 0.662–0.755 and ACC = 0.831–0.877). Those models based on bacterial groups showed better performance than those based on the entire dataset. The properties of the new peptides generated are related to important descriptors that encode physicochemical properties such as lower molecular weight, higher charge, propensity to form alpha-helical structures, lower hydrophobicity, and higher frequency of amino acids such as lysine and serine. Conclusions: Machine learning models allowed to establish the structure–activity relationships of antimicrobial peptides. Classification models performed better than regression models. These models allowed us to make predictions and new peptides with high antimicrobial potential were proposed.
Keywords