Frontiers in Medicine (Feb 2024)

A multi-variable predictive warning model for cervical cancer using clinical and SNPs data

  • Xiangqin Li,
  • Xiangqin Li,
  • Ruoqi Ning,
  • Ruoqi Ning,
  • Bing Xiao,
  • Bing Xiao,
  • Silu Meng,
  • Silu Meng,
  • Haiying Sun,
  • Haiying Sun,
  • Xinran Fan,
  • Xinran Fan,
  • Shuang Li,
  • Shuang Li

DOI
https://doi.org/10.3389/fmed.2024.1294230
Journal volume & issue
Vol. 11

Abstract

Read online

IntroductionCervical cancer is the fourth most common cancer among female worldwide. Early detection and intervention are essential. This study aims to construct an early predictive warning model for cervical cancer and precancerous lesions utilizing clinical data and simple nucleotide polymorphisms (SNPs).MethodsClinical data and germline SNPs were collected from 472 participants. Univariate logistic regression, least absolute shrinkage selection operator (LASSO), and stepwise regression were performed to screen variables. Logistic regression (LR), support vector machine (SVM), random forest (RF), decision tree (DT), extreme gradient boosting(XGBoost) and neural network(NN) were applied to establish models. The receiver operating characteristic (ROC) curve was used to compare the models’ efficiencies. The performance of models was validated using decision curve analysis (DCA).ResultsThe LR model, which included 6 SNPs and 2 clinical variables as independent risk factors for cervical carcinogenesis, was ultimately chosen as the most optimal model. The DCA showed that the LR model had a good clinical application.DiscussionThe predictive model effectively foresees cervical cancer risk using clinical and SNP data, aiding in planning timely interventions. It provides a transparent tool for refining clinical decisions in cervical cancer management.

Keywords