CalcAMP: A New Machine Learning Model for the Accurate Prediction of Antimicrobial Activity of Peptides
Colin Bournez,
Martijn Riool,
Leonie de Boer,
Robert A. Cordfunke,
Leonie de Best,
Remko van Leeuwen,
Jan Wouter Drijfhout,
Sebastian A. J. Zaat,
Gerard J. P. van Westen
Affiliations
Colin Bournez
Computational Drug Discovery, Drug Discovery and Safety, Leiden Academic Centre for Drug Research, Leiden University, P.O. Box 9502, 2300 RA Leiden, The Netherlands
Martijn Riool
Department of Medical Microbiology and Infection Prevention, Amsterdam Institute for Infection and Immunity, Amsterdam UMC, University of Amsterdam, 1105 AZ Amsterdam, The Netherlands
Leonie de Boer
Department of Medical Microbiology and Infection Prevention, Amsterdam Institute for Infection and Immunity, Amsterdam UMC, University of Amsterdam, 1105 AZ Amsterdam, The Netherlands
Robert A. Cordfunke
Department Immunology, Leiden University Medical Center, 2300 RC Leiden, The Netherlands
Leonie de Best
Madam Therapeutics B.V., Pivot Park Life Sciences Community, Kloosterstraat 9, 5349 AB Oss, The Netherlands
Remko van Leeuwen
Madam Therapeutics B.V., Pivot Park Life Sciences Community, Kloosterstraat 9, 5349 AB Oss, The Netherlands
Jan Wouter Drijfhout
Department Immunology, Leiden University Medical Center, 2300 RC Leiden, The Netherlands
Sebastian A. J. Zaat
Department of Medical Microbiology and Infection Prevention, Amsterdam Institute for Infection and Immunity, Amsterdam UMC, University of Amsterdam, 1105 AZ Amsterdam, The Netherlands
Gerard J. P. van Westen
Computational Drug Discovery, Drug Discovery and Safety, Leiden Academic Centre for Drug Research, Leiden University, P.O. Box 9502, 2300 RA Leiden, The Netherlands
To combat infection by microorganisms host organisms possess a primary arsenal via the innate immune system. Among them are defense peptides with the ability to target a wide range of pathogenic organisms, including bacteria, viruses, parasites, and fungi. Here, we present the development of a novel machine learning model capable of predicting the activity of antimicrobial peptides (AMPs), CalcAMP. AMPs, in particular short ones (<35 amino acids), can become an effective solution to face the multi-drug resistance issue arising worldwide. Whereas finding potent AMPs through classical wet-lab techniques is still a long and expensive process, a machine learning model can be useful to help researchers to rapidly identify whether peptides present potential or not. Our prediction model is based on a new data set constructed from the available public data on AMPs and experimental antimicrobial activities. CalcAMP can predict activity against both Gram-positive and Gram-negative bacteria. Different features either concerning general physicochemical properties or sequence composition have been assessed to retrieve higher prediction accuracy. CalcAMP can be used as an promising prediction asset to identify short AMPs among given peptide sequences.