Progress in Fishery Sciences (Jun 2024)
Screening Muscle Growth-Related Genes of Coregonus ussurinsis Berg Based on Transcriptome Sequencing
Abstract
Coregonus ussurinsis Berg is a rare cold-water fish found in Heilongjiang Province, which has high nutritional and economical value. The growth traits of fish are critical breeding target traits, and improving the growth efficiency of cultured fishes has always been a major issue for researchers. As an endangered fish, very limited research has been conducted on C. ussurinsis, and studies on its growth and development are still lacking. Therefore, investigating gene expression in C. ussurinsis muscles would significantly contribute to our understanding of their muscle development. RNA-Seq was used to find and study the specific genes and pathways of muscle development under different conditions. Recently, transcriptome sequencing has been applied to diverse animal populations, aiding in the selection of candidate genes related to important traits by comparing the global gene expression profiles between different animal populations with specific characteristics. This study aims to understand the genetic basis of muscle development in C. ussurinsis at the transcriptome level and to provide new insights into growth and development. To explore the molecular regulation mechanism of growth traits of C. ussurinsis, F2 individuals of C. ussurinsis were randomly selected from the mixed pool for test grouping (fast-growing group and slow-growing group). The dorsal muscle tissue was clipped from 10 fast-growing individuals (219.20±38.66 g, weight) and 10 slow-growing individuals (74.30±17.86 g, weight) for transcriptome sequencing to construct six cDNA libraries. High-throughput sequencing from Illumina NovaSeq 6000 and bioinformatics was used to determine the abundances and characteristics of transcripts. The differentially expressed genes were screened with FDR (false discovery rate) 1; the functions of these differentially expressed genes (DEGs) were annotated and analyzed by the Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) database to identify the genes and genetic pathways related to the development of muscle in C. ussurinsis. Moreover, to verify the sequencing results, real-time fluorescence quantitative PCR (qRT-PCR) was used to detect the expression levels of DEGs. The results showed that the correlation coefficients of all the samples used for transcriptome sequencing were above 0.83, indicating high correlation between the samples and experimental reliability. Transcriptome sequencing results showed that a total of 295 605 738 raw reads were assembled from the six cDNA libraries, and 283 133 612 clean reads were obtained after quality control. Q20 and Q30 sequences accounted for above 97.80% and 93.90%, respectively, and the content of GC bases accounted for more than 49.1% of the total bases. Through comparison with the genome using EdgeR software, which was used to analyze the differences in gene expression, 2 211 DEGs were preliminarily obtained from muscle, including 659 novel genes. Compared with the slow-growing group, 583 differential genes were up-regulated, and 1 628 differential genes were down-regulated in the fast-growing group. Function enrichment analysis found that the DEGs participated in 3 620 GO terms. Among them, 2 457 biological processes were primarily involved in cellular and metabolic processes; there were 782 molecular functions, primarily involved in binding function and catalytic activity processes, and 381 cellular components, primarily involved in cell and cell component processes. The enrichment analysis of the KEGG pathway found that a total of 251 signal pathways were obtained, among which 73 were significantly enriched (P < 0.05). Among them, the up-regulated DEGs were mainly involved in glycolysis/gluconeogenesis, methane metabolism, and biosynthesis of amino acids, while the down-regulated DEGs were mainly involved in osteoclast differentiation, IL-17 signaling pathway, and biosynthesis of amino acids. The genes related to muscle growth were significantly (P < 0.05) enriched in the MAPK signaling pathway, PI3K-Akt signaling pathway, tight junction, insulin signaling pathway, glycolysis/gluconeogenesis, and PPAR signaling pathway. These pathways might be closely related to muscle growth. Combined with the GO functional annotation, the KEGG pathway enrichment, and the annotation results, 31 potential growth-related candidate genes were preliminarily screened. Protein-protein interaction networks were used to further analyze the relationship between these differential genes. It was found that atp2a2, atp2a1, g6pc, igfbp1, myh1, myh4, myh6, myh7, myh9, and myh13 might be closely related to muscle growth regulation, and these 10 genes can be used as crucial candidate genes for the growth regulation of C. ussurinsis. The qRT-PCR validation of 10 randomly selected differential genes showed consistent gene expression trends with the transcriptome sequencing results, which indicated that the results obtained by transcriptome sequencing in this study were accurate. A total of 10 growth-related essential candidate genes were screened in this study; these genes affect the growth of C. ussurinsis by regulating their expression levels in muscle tissue. These results provide vital information for the further understanding of the molecular basis and marker-assisted breeding of the growth regulation of C. ussurinsis.
Keywords