Nature Communications (Jul 2024)

Low-input PacBio sequencing generates high-quality individual fly genomes and characterizes mutational processes

  • Hangxing Jia,
  • Shengjun Tan,
  • Yingao Cai,
  • Yanyan Guo,
  • Jieyu Shen,
  • Yaqiong Zhang,
  • Huijing Ma,
  • Qingzhu Zhang,
  • Jinfeng Chen,
  • Gexia Qiao,
  • Jue Ruan,
  • Yong E. Zhang

DOI
https://doi.org/10.1038/s41467-024-49992-6
Journal volume & issue
Vol. 15, no. 1
pp. 1 – 19

Abstract

Read online

Abstract Long-read sequencing, exemplified by PacBio, revolutionizes genomics, overcoming challenges like repetitive sequences. However, the high DNA requirement ( > 1 µg) is prohibitive for small organisms. We develop a low-input (100 ng), low-cost, and amplification-free library-generation method for PacBio sequencing (LILAP) using Tn5-based tagmentation and DNA circularization within one tube. We test LILAP with two Drosophila melanogaster individuals, and generate near-complete genomes, surpassing preexisting single-fly genomes. By analyzing variations in these two genomes, we characterize mutational processes: complex transpositions (transposon insertions together with extra duplications and/or deletions) prefer regions characterized by non-B DNA structures, and gene conversion of transposons occurs on both DNA and RNA levels. Concurrently, we generate two complete assemblies for the endosymbiotic bacterium Wolbachia in these flies and similarly detect transposon conversion. Thus, LILAP promises a broad PacBio sequencing adoption for not only mutational studies of flies and their symbionts but also explorations of other small organisms or precious samples.