Scientific Reports (Jul 2023)

Exploring biomarkers of premature ovarian insufficiency based on oxford nanopore transcriptional profile and machine learning

  • Zhaoyang Yu,
  • Mujun Li,
  • Weilong Peng

DOI
https://doi.org/10.1038/s41598-023-38754-x
Journal volume & issue
Vol. 13, no. 1
pp. 1 – 10

Abstract

Read online

Abstract Premature ovarian insufficiency (POI) is a reproductive endocrine disorder characterized by infertility and perimenopausal syndrome, with a highly heterogeneous genetic etiology and its mechanism is not fully understood. Therefore, we utilized Oxford Nanopore Technology (ONT) for the first time to characterize the full-length transcript profile, and revealed biomarkers, pathway and molecular mechanisms for POI by bioinformatics analysis and machine learning. Ultimately, we identified 272 differentially expressed genes, 858 core genes, and 25 hub genes by analysis of differential expression, gene set enrichment, and protein–protein interactions. Seven candidate genes were identified based on the intersection features of the random forest and Boruta algorithm. qRT-PCR results indicated that COX5A, UQCRFS1, LCK, RPS2 and EIF5A exhibited consistent expression trends with sequencing data and have potential as biomarkers. Additionally, GSEA analysis revealed that the pathophysiology of POI is closely associated with inhibition of the PI3K-AKT pathway, oxidative phosphorylation and DNA damage repair, as well as activation of inflammatory and apoptotic pathways. Furthermore, we emphasize that downregulation of respiratory chain enzyme complex subunits and inhibition of oxidative phosphorylation pathways play crucial roles in the pathophysiology of POI. In conclusion, our utilization of long-read sequencing has refined the annotation information within the POI transcriptional profile. This valuable data provides novel insights for further exploration into molecular regulatory networks and potential biomarkers associated with POI.