PLoS ONE (Jan 2012)

Exploring sequence characteristics related to high-level production of secreted proteins in Aspergillus niger.

  • Bastiaan A van den Berg,
  • Marcel J T Reinders,
  • Marc Hulsman,
  • Liang Wu,
  • Herman J Pel,
  • Johannes A Roubos,
  • Dick de Ridder

DOI
https://doi.org/10.1371/journal.pone.0045869
Journal volume & issue
Vol. 7, no. 10
p. e45869

Abstract

Read online

Protein sequence features are explored in relation to the production of over-expressed extracellular proteins by fungi. Knowledge on features influencing protein production and secretion could be employed to improve enzyme production levels in industrial bioprocesses via protein engineering. A large set, over 600 homologous and nearly 2,000 heterologous fungal genes, were overexpressed in Aspergillus niger using a standardized expression cassette and scored for high versus no production. Subsequently, sequence-based machine learning techniques were applied for identifying relevant DNA and protein sequence features. The amino-acid composition of the protein sequence was found to be most predictive and interpretation revealed that, for both homologous and heterologous gene expression, the same features are important: tyrosine and asparagine composition was found to have a positive correlation with high-level production, whereas for unsuccessful production, contributions were found for methionine and lysine composition. The predictor is available online at http://bioinformatics.tudelft.nl/hipsec. Subsequent work aims at validating these findings by protein engineering as a method for increasing expression levels per gene copy.