Scientific Data (Aug 2024)

Targeted DNA-seq and RNA-seq of Reference Samples with Short-read and Long-read Sequencing

  • Binsheng Gong,
  • Dan Li,
  • Paweł P. Łabaj,
  • Bohu Pan,
  • Natalia Novoradovskaya,
  • Danielle Thierry-Mieg,
  • Jean Thierry-Mieg,
  • Guangchun Chen,
  • Anne Bergstrom Lucas,
  • Jennifer S. LoCoco,
  • Todd A. Richmond,
  • Elizabeth Tseng,
  • Rebecca Kusko,
  • Scott Happe,
  • Timothy R. Mercer,
  • Carlos Pabón-Peña,
  • Michael Salmans,
  • Hagen U. Tilgner,
  • Wenzhong Xiao,
  • Donald J. Johann,
  • Wendell Jones,
  • Weida Tong,
  • Christopher E. Mason,
  • David P. Kreil,
  • Joshua Xu

DOI
https://doi.org/10.1038/s41597-024-03741-y
Journal volume & issue
Vol. 11, no. 1
pp. 1 – 13

Abstract

Read online

Abstract Next-generation sequencing (NGS) has revolutionized genomic research by enabling high-throughput, cost-effective genome and transcriptome sequencing accelerating personalized medicine for complex diseases, including cancer. Whole genome/transcriptome sequencing (WGS/WTS) provides comprehensive insights, while targeted sequencing is more cost-effective and sensitive. In comparison to short-read sequencing, which still dominates the field due to high speed and cost-effectiveness, long-read sequencing can overcome alignment limitations and better discriminate similar sequences from alternative transcripts or repetitive regions. Hybrid sequencing combines the best strengths of different technologies for a more comprehensive view of genomic/transcriptomic variations. Understanding each technology’s strengths and limitations is critical for translating cutting-edge technologies into clinical applications. In this study, we sequenced DNA and RNA libraries of reference samples using various targeted DNA and RNA panels and the whole transcriptome on both short-read and long-read platforms. This study design enables a comprehensive analysis of sequencing technologies, targeting protocols, and library preparation methods. Our expanded profiling landscape establishes a reference point for assessing current sequencing technologies, facilitating informed decision-making in genomic research and precision medicine.