BMC Genomics (Jan 2022)

Combined assembly of long and short sequencing reads improve the efficiency of exploring the soil metagenome

  • Guoshun Xu,
  • Liwen Zhang,
  • Xiaoqing Liu,
  • Feifei Guan,
  • Yuquan Xu,
  • Haitao Yue,
  • Jin-Qun Huang,
  • Jieyin Chen,
  • Ningfeng Wu,
  • Jian Tian

DOI
https://doi.org/10.1186/s12864-021-08260-3
Journal volume & issue
Vol. 23, no. 1
pp. 1 – 15

Abstract

Read online

Abstract Background Advances in DNA sequencing technologies have transformed our capacity to perform life science research, decipher the dynamics of complex soil microbial communities and exploit them for plant disease management. However, soil is a complex conglomerate, which makes functional metagenomics studies very challenging. Results Metagenomes were assembled by long-read (PacBio, PB), short-read (Illumina, IL), and mixture of PB and IL (PI) sequencing of soil DNA samples were compared. Ortholog analyses and functional annotation revealed that the PI approach significantly increased the contig length of the metagenomic sequences compared to IL and enlarged the gene pool compared to PB. The PI approach also offered comparable or higher species abundance than either PB or IL alone, and showed significant advantages for studying natural product biosynthetic genes in the soil microbiomes. Conclusion Our results provide an effective strategy for combining long and short-read DNA sequencing data to explore and distill the maximum information out of soil metagenomics.

Keywords