Frontiers in Immunology (Sep 2022)

Predictive mutation signature of immunotherapy benefits in NSCLC based on machine learning algorithms

  • Zhichao Liu,
  • Zhichao Liu,
  • Guo Lin,
  • Guo Lin,
  • Zeping Yan,
  • Linduo Li,
  • Xingchen Wu,
  • Jingrong Shi,
  • Jianxing He,
  • Lei Zhao,
  • Hengrui Liang,
  • Wei Wang

DOI
https://doi.org/10.3389/fimmu.2022.989275
Journal volume & issue
Vol. 13

Abstract

Read online

BackgroundDeveloping prediction tools for immunotherapy approaches is a clinically important and rapidly emerging field. The routinely used prediction biomarker is inaccurate and may not adequately utilize large amounts of medical data. Machine learning is a promising way to predict the benefit of immunotherapy from individual data by individuating the most important features from genomic data and clinical characteristics.MethodsMachine learning was applied to identify a list of candidate genes that may predict immunotherapy benefits using data from the published cohort of 853 patients with NSCLC. We used XGBoost to capture nonlinear relations among many mutation genes and ICI benefits. The value of the derived machine learning-based mutation signature (ML-signature) on immunotherapy efficacy was evaluated and compared with the tumor mutational burden (TMB) and other clinical characteristics. The predictive power of ML-signature was also evaluated in independent cohorts of patients with NSCLC treated with ICI.ResultsWe constructed the ML-signature based on 429 (training/validation = 8/2) patients who received immunotherapy and extracted 88 eligible predictive genes. Additionally, we conducted internal and external validation with the utility of the OAK+POPLAR dataset and independent cohorts, respectively. This ML-signature showed the enrichment in immune-related signaling pathways and compared to TMB, ML-signature was equipped with favorable predictive value and stratification.ConclusionPrevious studies proposed no predictive difference between original TMB and modified TMB, and original TMB contains some genes with no predictive value. To demonstrate that fewer genetic tests are sufficient to predict immunotherapy efficacy, we used machine learning to screen out gene panels, which are used to calculate TMB. Therefore, we obtained the 88-gene panel, which showed the favorable prediction performance and stratification effect compared to the original TMB.

Keywords