PLoS ONE (Dec 2009)

Proteomic detection of non-annotated protein-coding genes in Pseudomonas fluorescens Pf0-1.

  • Wook Kim,
  • Mark W Silby,
  • Sam O Purvine,
  • Julie S Nicoll,
  • Kim K Hixson,
  • Matt Monroe,
  • Carrie D Nicora,
  • Mary S Lipton,
  • Stuart B Levy

DOI
https://doi.org/10.1371/journal.pone.0008455
Journal volume & issue
Vol. 4, no. 12
p. e8455

Abstract

Read online

Genome sequences are annotated by computational prediction of coding sequences, followed by similarity searches such as BLAST, which provide a layer of possible functional information. While the existence of processes such as alternative splicing complicates matters for eukaryote genomes, the view of bacterial genomes as a linear series of closely spaced genes leads to the assumption that computational annotations that predict such arrangements completely describe the coding capacity of bacterial genomes. We undertook a proteomic study to identify proteins expressed by Pseudomonas fluorescens Pf0-1 from genes that were not predicted during the genome annotation. Mapping peptides to the Pf0-1 genome sequence identified sixteen non-annotated protein-coding regions, of which nine were antisense to predicted genes, six were intergenic, and one read in the same direction as an annotated gene but in a different frame. The expression of all but one of the newly discovered genes was verified by RT-PCR. Few clues as to the function of the new genes were gleaned from informatic analyses, but potential orthologs in other Pseudomonas genomes were identified for eight of the new genes. The 16 newly identified genes improve the quality of the Pf0-1 genome annotation, and the detection of antisense protein-coding genes indicates the under-appreciated complexity of bacterial genome organization.