Integrated structure-based protein interface prediction

M. Walder; E. Edelstein; M. Carroll; S. Lazarev; J. E. Fajardo; A. Fiser; R. Viswanathan

doi:10.1186/s12859-022-04852-2

BMC Bioinformatics (Jul 2022)

Integrated structure-based protein interface prediction

M. Walder,
E. Edelstein,
M. Carroll,
S. Lazarev,
J. E. Fajardo,
A. Fiser,
R. Viswanathan

Affiliations

M. Walder: Department of Chemistry, Yeshiva College, Yeshiva University
E. Edelstein: Department of Chemistry, Yeshiva College, Yeshiva University
M. Carroll: Department of Chemistry, Yeshiva College, Yeshiva University
S. Lazarev: Department of Chemistry, Yeshiva College, Yeshiva University
J. E. Fajardo: Department of Systems and Computational Biology, Albert Einstein College of Medicine
A. Fiser: Department of Systems and Computational Biology, Albert Einstein College of Medicine
R. Viswanathan: Department of Chemistry, Yeshiva College, Yeshiva University

DOI: https://doi.org/10.1186/s12859-022-04852-2
Journal volume & issue: Vol. 23, no. 1
pp. 1 – 16

Abstract

Read online

Abstract Background Identifying protein interfaces can inform how proteins interact with their binding partners, uncover the regulatory mechanisms that control biological functions and guide the development of novel therapeutic agents. A variety of computational approaches have been developed for predicting a protein’s interfacial residues from its known sequence and structure. Methods using the known three-dimensional structures of proteins can be template-based or template-free. Template-based methods have limited success in predicting interfaces when homologues with known complex structures are not available to use as templates. The prediction performance of template-free methods that only rely only upon proteins’ intrinsic properties is limited by the amount of biologically relevant features that can be included in an interface prediction model. Results We describe the development of an integrated method for protein interface prediction (ISPIP) to explore the hypothesis that the efficacy of a computational prediction method of protein binding sites can be enhanced by using a combination of methods that rely on orthogonal structure-based properties of a query protein, combining and balancing both template-free and template-based features. ISPIP is a method that integrates these approaches through simple linear or logistic regression models and more complex decision tree models. On a diverse test set of 156 query proteins, ISPIP outperforms each of its individual classifiers in identifying protein binding interfaces. Conclusions The integrated method captures the best performance of individual classifiers and delivers an improved interface prediction. The method is robust and performs well even when one of the individual classifiers performs poorly on a particular query protein. This work demonstrates that integrating orthogonal methods that depend on different structural properties of proteins performs better at interface prediction than any individual classifier alone.

Published in BMC Bioinformatics

ISSN: 1471-2105 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Science: Biology (General)
Website: http://www.biomedcentral.com/bmcbioinformatics/

About the journal

Abstract

Keywords