Scientific Data (Sep 2024)

The first Chromosomal-level genome assembly of Sageretia thea using Nanopore long reads and Pore-C technology

  • Jihoon Jo,
  • Jong-Soo Park,
  • Hari Won,
  • Jun Seong Jeong,
  • Tae Won Jung,
  • Kyung Jun Lee,
  • Shin Ae Lee

DOI
https://doi.org/10.1038/s41597-024-03798-9
Journal volume & issue
Vol. 11, no. 1
pp. 1 – 8

Abstract

Read online

Abstract Sageretia thea, a notable species within the mock buckthorn genus, is recognized for its intriguing biogeographical distribution and diverse medicinal properties. Despite this significance, genomic studies on S. thea are still in the nascent stages. We present the first chromosome-level genome assembly of S. thea that was generated using a combination of Oxford Nanopore long-read and Illumina short-read sequencing technologies complemented by Pore-C chromatin conformation capture. The genome assembly had a size of 197.8 Mb with 12 chromosomal scaffolds and a scaffold N50 length of 15.9 Mb. A total of 25,434 protein-coding genes were identified and functionally annotated, and the gene model indicated 96.5% complete eukaryotic BUSCOs. Additionally, orthologous gene profiling and synteny analysis were performed to elucidate the evolutionary relationships within the Rhamnaceae family and Rosales. This high-quality chromosomal genome is the first genomic view of S. thea, which will serve as the basis for future studies on its biological and medicinal properties, and evolutionary history.