Scientific Data (May 2023)

Allele-aware chromosome-level genome assembly of the autohexaploid Diospyros kaki Thunb

  • Huawei Li,
  • Peng Sun,
  • Yiru Wang,
  • Zhongren Zhang,
  • Jun Yang,
  • Yujing Suo,
  • Weijuan Han,
  • Songfeng Diao,
  • Fangdong Li,
  • Jianmin Fu

DOI
https://doi.org/10.1038/s41597-023-02175-2
Journal volume & issue
Vol. 10, no. 1
pp. 1 – 12

Abstract

Read online

Abstract Artificially improving persimmon (Diospyros kaki Thunb.), one of the most important fruit trees, remains challenging owing to the lack of reference genomes. In this study, we generated an allele-aware chromosome-level genome assembly for the autohexaploid persimmon ‘Xiaoguotianshi’ (Chinese-PCNA type) using PacBio CCS and Hi-C technology. The final assembly contained 4.52 Gb, with a contig N50 value of 5.28 Mb and scaffold N50 value of 44.01 Mb, of which 4.06 Gb (89.87%) of the assembly were anchored onto 90 chromosome-level pseudomolecules comprising 15 homologous groups with 6 allelic chromosomes in each. A total of 153,288 protein-coding genes were predicted, of which 98.60% were functionally annotated. Repetitive sequences accounted for 64.02% of the genome; and 110,480 rRNAs, 12,297 tRNAs, 1,483 miRNAs, and 3,510 snRNA genes were also identified. This genome assembly fills the knowledge gap in the autohexaploid persimmon genome, which is conducive in the study on the regulatory mechanisms underlying the major economically advantageous traits of persimmons and promoting breeding programs.