Scientific Reports (Jul 2024)

Comprehensive application of AI algorithms with TCR NGS data for glioma diagnosis

  • Kaiyue Zhou,
  • Zhengliang Xiao,
  • Qi Liu,
  • Xu Wang,
  • Jiaxin Huo,
  • Xiaoqi Wu,
  • Xiaoxiao Zhao,
  • Xiaohan Feng,
  • Baoyi Fu,
  • Pengfei Xu,
  • Yunyun Deng,
  • Wenwen Xiao,
  • Tao Sun,
  • Lin Da

DOI
https://doi.org/10.1038/s41598-024-65305-9
Journal volume & issue
Vol. 14, no. 1
pp. 1 – 12

Abstract

Read online

Abstract T-cell receptor (TCR) detection can examine the extent of T-cell immune responses. Therefore, the article analyzed characteristic data of glioma obtained by DNA-based TCR high-throughput sequencing, to predict the disease with fewer biomarkers and higher accuracy. We downloaded data online and obtained six TCR-related diversity indices to establish a multidimensional classification system. By comparing actual presence of the 602 correlated sequences, we obtained two-dimensional and multidimensional datasets. Multiple classification methods were utilized for both datasets with the classification accuracy of multidimensional data slightly less to two-dimensional datasets. This study reduced the TCR β sequences through feature selection methods like RFECV (Recursive Feature Elimination with Cross-Validation). Consequently, using only the presence of these three sequences, the classification AUC value of 96.67% can be achieved. The combination of the three correlated TCR clones obtained at a source data threshold of 0.1 is: CASSLGGNTEAFF_TRBV12_TRBJ1-1, CASSYSDTGELFF_TRBV6_TRBJ2-2, and CASSLTGNTEAFF_TRBV12_TRBJ1-1. At 0.001, the combination is: CASSLGETQYF_TRBV12_TRBJ2-5, CASSLGGNQPQHF_TRBV12_TRBJ1-5, and CASSLSGNTIYF_TRBV12_TRBJ1-3. This method can serve as a potential diagnostic and therapeutic tool, facilitating diagnosis and treatment of glioma and other cancers.

Keywords