Scientific Data (Jul 2024)
Chromosome-level genome assembly of Plagiognathops microlepis based on PacBio HiFi and Hi-C sequencing
Abstract
Abstract Plagiognathops microlepis is an economic freshwater fish in the subfamily Xenocyprinae of Cyprinidae. It is widely distributed in the freshwater ecosystem of China, with moderate economic value and broad development prospects. However, the lack of genomic resources has limited our understanding on the genetic basis, phylogenetic status and adaptive evolution strategies of this fish. Here, we assembled a chromosome-level reference genome of P. microlepis by integrating Pacbio HiFi long-reads, Illumina short-reads and Hi-C sequencing data. The size of this genome is 1004.34 Mb with a contig N50 of 38.80 Mb. Using Hi-C sequencing data, 99.59% of the assembled sequences were further anchored to 24 chromosomes. A total of 578.91 Mb repeat sequences and 28,337 protein-coding genes were predicted in the current genome, of which, 26,929 genes were functionally annotated. This genome provides valuable information for investigating the phylogeny and evolutionary history of cyprinid fishes, as well as the genetic basis of adaptive strategies and special traits in P. microlepis.