Frontiers in Genetics (Jan 2025)

Identification of key genes affecting intramuscular fat deposition in pigs using machine learning models

  • Yumei Shi,
  • Yumei Shi,
  • Xini Wang,
  • Shaokang Chen,
  • Yanhui Zhao,
  • Yan Wang,
  • Xihui Sheng,
  • Xiaolong Qi,
  • Lei Zhou,
  • Yu Feng,
  • Jianfeng Liu,
  • Chuduan Wang,
  • Kai Xing

DOI
https://doi.org/10.3389/fgene.2024.1503148
Journal volume & issue
Vol. 15

Abstract

Read online

Intramuscular fat (IMF) is an important indicator for evaluating meat quality. Transcriptome sequencing (RNA-seq) is widely used for the study of IMF deposition. Machine learning (ML) is a new big data fitting method that can effectively fit complex data, accurately identify samples and genes, and it plays an important role in omics research. Therefore, this study aimed to analyze RNA-seq data by ML method to identify differentially expressed genes (DEGs) affecting IMF deposition in pigs. In this study, a total of 74 RNA-seq data from muscle tissue samples were used. A total of 155 DEGs were identified using a limma package between the two groups. 100 and 11 significant genes were identified by support vector machine recursive feature elimination (SVM-RFE) and random forest (RF) models, respectively. A total of six intersecting genes were in both models. KEGG pathway enrichment analysis of the intersecting genes revealed that these genes were enriched in pathways associated with lipid deposition. These pathways include α-linolenic acid metabolism, linoleic acid metabolism, ether lipid metabolism, arachidonic acid metabolism, and glycerophospholipid metabolism. Four key genes affecting intramuscular fat deposition, PLA2G6, MPV17, NUDT2, and ND4L, were identified based on significant pathways. The results of this study are important for the elucidation of the molecular regulatory mechanism of intramuscular fat deposition and the effective improvement of IMF content in pigs.

Keywords