Scientific Data (Feb 2025)
Chromosome-level genome assembly of flathead asp (Pseudaspius leptocephalus)
Abstract
Abstract Pseudaspius leptocephalus is an economically and ecologically important fish species endemic to the Heilongjiang River Basin. Sequencing of the complete genome of P. leptocephalus could provide new insights with implications for the breeding and conservation of this species. The aim of this study was to obtain the chromosome-level genome of P. leptocephalus using PacBio and Hi-C sequencing. PacBio sequencing yielded an assembly of 889.21 Mb, with a contig N50 size of 10.9 Mb. Hi-C sequencing data were used for chromosome mapping and ultimately yielded 25 chromosome sequences. The success rate of chromosome mapping was 95.0%, with a BUSCO evaluation integrity score of 97.3%. A total of 25,531 protein-coding genes were predicted and 24,787 genes were functionally annotated. The BUSCO evaluation integrity score for the predicted protein-coding genes was 93.7%. Repeat sequences accounted for 39.5% of the total length of the genome. This work could aid the artificial breeding of P. leptocephalus, promote the conservation of biological resources, and provide a basis for future genomic and evolutionary studies of this species.