BMC Genomics (Jul 2011)

Gene discovery by genome-wide CDS re-prediction and microarray-based transcriptional analysis in phytopathogen <it>Xanthomonas campestris</it>

  • Pühler Alfred,
  • Xu Yuquan,
  • Tang Ji-Liang,
  • He Yong-Qiang,
  • Jiang Bo-Le,
  • Vorhölter Frank-Jörg,
  • Zhou Lian,
  • He Ya-Wen

DOI
https://doi.org/10.1186/1471-2164-12-359
Journal volume & issue
Vol. 12, no. 1
p. 359

Abstract

Read online

Abstract Background One of the major tasks of the post-genomic era is "reading" genomic sequences in order to extract all the biological information contained in them. Although a wide variety of techniques is used to solve the gene finding problem and a number of prokaryotic gene-finding software are available, gene recognition in bacteria is far from being always straightforward. Results This study reported a thorough search for new CDS in the two published Xcc genomes. In the first, putative CDSs encoded in the two genomes were re-predicted using three gene finders, resulting in the identification of 2850 putative new CDSs. In the second, similarity searching was conducted and 278 CDSs were found to have homologs in other bacterial species. In the third, oligonucleotide microarray and RT-PCR analysis identified 147 CDSs with detectable mRNA transcripts. Finally, in-frame deletion and subsequent phenotype analysis of confirmed that Xcc_CDS002 encoding a novel SIR2-like domain protein is involved in virulence and Xcc_CDS1553 encoding a ArsR family transcription factor is involved in arsenate resistance. Conclusions Despite sophisticated approaches available for genome annotation, many cellular transcripts have remained unidentified so far in Xcc genomes. Through a combined strategy involving bioinformatic, postgenomic and genetic approaches, a reliable list of 306 new CDSs was identified and a more thorough understanding of some cellular processes was gained.

Keywords