Scientific Data (Jul 2024)

Telomere-to-telomere genome assembly of the goose Anser cygnoides

  • Hongchang Zhao,
  • Hao Zhou,
  • Guobo Sun,
  • Biao Dong,
  • Wenqi Zhu,
  • Xiaohui Mu,
  • Xiaoming Li,
  • Jun Wang,
  • Mengli Zhao,
  • Wenhao Yang,
  • Gansheng Zhang,
  • Rongchao Ji,
  • Tuoyu Geng,
  • Daoqing Gong,
  • He Meng,
  • Jian Wang

DOI
https://doi.org/10.1038/s41597-024-03567-8
Journal volume & issue
Vol. 11, no. 1
pp. 1 – 9

Abstract

Read online

Abstract Our study presents the assembly of a high-quality Taihu goose genome at the Telomere-to-Telomere (T2T) level. By employing advanced sequencing technologies, including Pacific Biosciences HiFi reads, Oxford Nanopore long reads, Illumina short reads, and chromatin conformation capture (Hi-C), we achieved an exceptional assembly. The T2T assembly encompasses a total length of 1,197,991,206 bp, with contigs N50 reaching 33,928,929 bp and scaffold N50 attaining 81,007,908 bp. It consists of 73 scaffolds, including 38 autosomes and one pair of Z/W sex chromosomes. Importantly, 33 autosomes were assembled without any gap, resulting in a contiguous representation. Furthermore, gene annotation efforts identified 34,898 genes, including 436,162 RNA transcripts, encompassing 806,158 exons, 743,910 introns, 651,148 coding sequences (CDS), and 135,622 untranslated regions (UTR). The T2T-level chromosome-scale goose genome assembly provides a vital foundation for future genetic improvement and understanding the genetic mechanisms underlying important traits in geese.