BMC Genomics (Sep 2021)

Complete genome sequence and identification of polyunsaturated fatty acid biosynthesis genes of the myxobacterium Minicystis rosea DSM 24000T

  • Shilpee Pal,
  • Gaurav Sharma,
  • Srikrishna Subramanian

DOI
https://doi.org/10.1186/s12864-021-07955-x
Journal volume & issue
Vol. 22, no. 1
pp. 1 – 15

Abstract

Read online

Abstract Background Myxobacteria harbor numerous biosynthetic gene clusters that can produce a diverse range of secondary metabolites. Minicystis rosea DSM 24000T is a soil-dwelling myxobacterium belonging to the suborderSorangiineae and family Polyangiaceae and is known to produce various secondary metabolites as well as polyunsaturated fatty acids (PUFAs). Here, we use whole-genome sequencing to explore the diversity of biosynthetic gene clusters in M. rosea. Results Using PacBio sequencing technology, we assembled the 16.04 Mbp complete genome of M. rosea DSM 24000T, the largest bacterial genome sequenced to date. About 44% of its coding potential represents paralogous genes predominantly associated with signal transduction, transcriptional regulation, and protein folding. These genes are involved in various essential functions such as cellular organization, diverse niche adaptation, and bacterial cooperation, and enable social behavior like gliding motility, sporulation, and predation, typical of myxobacteria. A profusion of eukaryotic-like kinases (353) and an elevated ratio of phosphatases (8.2/1) in M. rosea as compared to other myxobacteria suggest gene duplication as one of the primary modes of genome expansion. About 7.7% of the genes are involved in the biosynthesis of a diverse array of secondary metabolites such as polyketides, terpenes, and bacteriocins. Phylogeny of the genes involved in PUFA biosynthesis (pfa) together with the conserved synteny of the complete pfa gene cluster suggests acquisition via horizontal gene transfer from Actinobacteria. Conclusion Overall, this study describes the complete genome sequence of M. rosea, comparative genomic analysis to explore the putative reasons for its large genome size, and explores the secondary metabolite potential, including the biosynthesis of polyunsaturated fatty acids.

Keywords