Biotechnology for Biofuels (May 2018)

Complete genome sequence and the expression pattern of plasmids of the model ethanologen Zymomonas mobilis ZM4 and its xylose-utilizing derivatives 8b and 2032

  • Shihui Yang,
  • Jessica M. Vera,
  • Jeff Grass,
  • Giannis Savvakis,
  • Oleg V. Moskvin,
  • Yongfu Yang,
  • Sean J. McIlwain,
  • Yucai Lyu,
  • Irene Zinonos,
  • Alexander S. Hebert,
  • Joshua J. Coon,
  • Donna M. Bates,
  • Trey K. Sato,
  • Steven D. Brown,
  • Michael E. Himmel,
  • Min Zhang,
  • Robert Landick,
  • Katherine M. Pappas,
  • Yaoping Zhang

DOI
https://doi.org/10.1186/s13068-018-1116-x
Journal volume & issue
Vol. 11, no. 1
pp. 1 – 20

Abstract

Read online

Abstract Background Zymomonas mobilis is a natural ethanologen being developed and deployed as an industrial biofuel producer. To date, eight Z. mobilis strains have been completely sequenced and found to contain 2–8 native plasmids. However, systematic verification of predicted Z. mobilis plasmid genes and their contribution to cell fitness has not been hitherto addressed. Moreover, the precise number and identities of plasmids in Z. mobilis model strain ZM4 have been unclear. The lack of functional information about plasmid genes in ZM4 impedes ongoing studies for this model biofuel-producing strain. Results In this study, we determined the complete chromosome and plasmid sequences of ZM4 and its engineered xylose-utilizing derivatives 2032 and 8b. Compared to previously published and revised ZM4 chromosome sequences, the ZM4 chromosome sequence reported here contains 65 nucleotide sequence variations as well as a 2400-bp insertion. Four plasmids were identified in all three strains, with 150 plasmid genes predicted in strain ZM4 and 2032, and 153 plasmid genes predicted in strain 8b due to the insertion of heterologous DNA for expanded substrate utilization. Plasmid genes were then annotated using Blast2GO, InterProScan, and systems biology data analyses, and most genes were found to have apparent orthologs in other organisms or identifiable conserved domains. To verify plasmid gene prediction, RNA-Seq was used to map transcripts and also compare relative gene expression under various growth conditions, including anaerobic and aerobic conditions, or growth in different concentrations of biomass hydrolysates. Overall, plasmid genes were more responsive to varying hydrolysate concentrations than to oxygen availability. Additionally, our results indicated that although all plasmids were present in low copy number (about 1–2 per cell), the copy number of some plasmids varied under specific growth conditions or due to heterologous gene insertion. Conclusions The complete genome of ZM4 and two xylose-utilizing derivatives is reported in this study, with an emphasis on identifying and characterizing plasmid genes. Plasmid gene annotation, validation, expression levels at growth conditions of interest, and contribution to host fitness are reported for the first time.

Keywords