BMC Bioinformatics (Dec 2020)

An automated protocol for modelling peptide substrates to proteases

  • Rodrigo Ochoa,
  • Mikhail Magnitov,
  • Roman A. Laskowski,
  • Pilar Cossio,
  • Janet M. Thornton

DOI
https://doi.org/10.1186/s12859-020-03931-6
Journal volume & issue
Vol. 21, no. 1
pp. 1 – 20

Abstract

Read online

Abstract Background Proteases are key drivers in many biological processes, in part due to their specificity towards their substrates. However, depending on the family and molecular function, they can also display substrate promiscuity which can also be essential. Databases compiling specificity matrices derived from experimental assays have provided valuable insights into protease substrate recognition. Despite this, there are still gaps in our knowledge of the structural determinants. Here, we compile a set of protease crystal structures with bound peptide-like ligands to create a protocol for modelling substrates bound to protease structures, and for studying observables associated to the binding recognition. Results As an application, we modelled a subset of protease–peptide complexes for which experimental cleavage data are available to compare with informational entropies obtained from protease–specificity matrices. The modelled complexes were subjected to conformational sampling using the Backrub method in Rosetta, and multiple observables from the simulations were calculated and compared per peptide position. We found that some of the calculated structural observables, such as the relative accessible surface area and the interaction energy, can help characterize a protease’s substrate recognition, giving insights for the potential prediction of novel substrates by combining additional approaches. Conclusion Overall, our approach provides a repository of protease structures with annotated data, and an open source computational protocol to reproduce the modelling and dynamic analysis of the protease–peptide complexes.

Keywords