PLoS Biology (Apr 2007)

Superfamily assignments for the yeast proteome through integration of structure prediction with the gene ontology.

  • Lars Malmström,
  • Michael Riffle,
  • Charlie E M Strauss,
  • Dylan Chivian,
  • Trisha N Davis,
  • Richard Bonneau,
  • David Baker

DOI
https://doi.org/10.1371/journal.pbio.0050076
Journal volume & issue
Vol. 5, no. 4
p. e76

Abstract

Read online

Saccharomyces cerevisiae is one of the best-studied model organisms, yet the three-dimensional structure and molecular function of many yeast proteins remain unknown. Yeast proteins were parsed into 14,934 domains, and those lacking sequence similarity to proteins of known structure were folded using the Rosetta de novo structure prediction method on the World Community Grid. This structural data was integrated with process, component, and function annotations from the Saccharomyces Genome Database to assign yeast protein domains to SCOP superfamilies using a simple Bayesian approach. We have predicted the structure of 3,338 putative domains and assigned SCOP superfamily annotations to 581 of them. We have also assigned structural annotations to 7,094 predicted domains based on fold recognition and homology modeling methods. The domain predictions and structural information are available in an online database at http://rd.plos.org/10.1371_journal.pbio.0050076_01.