Scientific Reports (Nov 2022)
Identification of key biomarkers for STAD using filter feature selection approaches
Abstract
Abstract Gastric cancer (GC) is the fifth most common cancer and the third leading cause of cancer death worldwide. Discovery of diagnostic biomarkers prompts the early detection of GC. In this study, we used limma method combined with joint mutual information (JMI), a machine learning algorithm, to identify a signature of 11 genes that performed well in distinguishing tumor and normal samples in a stomach adenocarcinoma cohort. Other two GC datasets were used to validate the classifying performances. Several of the candidate genes were correlated with GC tumor progression and survival. Overall, we highlight the application of feature selection approaches in the analysis of high-dimensional biological data, which will improve study accuracies and reduce workloads for the researchers when identifying potential tumor biomarkers.