PLoS ONE (Jan 2014)

High precision prediction of functional sites in protein structures.

  • Ljubomir Buturovic,
  • Mike Wong,
  • Grace W Tang,
  • Russ B Altman,
  • Dragutin Petkovic

DOI
https://doi.org/10.1371/journal.pone.0091240
Journal volume & issue
Vol. 9, no. 3
p. e91240

Abstract

Read online

We address the problem of assigning biological function to solved protein structures. Computational tools play a critical role in identifying potential active sites and informing screening decisions for further lab analysis. A critical parameter in the practical application of computational methods is the precision, or positive predictive value. Precision measures the level of confidence the user should have in a particular computed functional assignment. Low precision annotations lead to futile laboratory investigations and waste scarce research resources. In this paper we describe an advanced version of the protein function annotation system FEATURE, which achieved 99% precision and average recall of 95% across 20 representative functional sites. The system uses a Support Vector Machine classifier operating on the microenvironment of physicochemical features around an amino acid. We also compared performance of our method with state-of-the-art sequence-level annotator Pfam in terms of precision, recall and localization. To our knowledge, no other functional site annotator has been rigorously evaluated against these key criteria. The software and predictive models are incorporated into the WebFEATURE service at http://feature.stanford.edu/wf4.0-beta.