Interplay of machine learning and bioinformatics approaches to identify genetic biomarkers that affect survival of patients with glioblastoma

Nitun Kumar Podder; Humayan Kabir Rana; Arpa Kar Puza; Md Imam Hasan; Shudeb Babu Sen Omit; Pintu Chandra Shill; Md Abdur Rahim; Rittika Shamsuddin; Bidhan Chandra Podder; Md Habibur Rahman

Informatics in Medicine Unlocked (Jan 2024)

Interplay of machine learning and bioinformatics approaches to identify genetic biomarkers that affect survival of patients with glioblastoma

Nitun Kumar Podder,
Humayan Kabir Rana,
Arpa Kar Puza,
Md Imam Hasan,
Shudeb Babu Sen Omit,
Pintu Chandra Shill,
Md Abdur Rahim,
Rittika Shamsuddin,
Bidhan Chandra Podder,
Md Habibur Rahman

Affiliations

Nitun Kumar Podder: Department of Computer Science and Engineering, Pabna University of Science and Technology, Pabna, Bangladesh; Department of Computer Science and Engineering, Khulna University of Engineering & Technology, Khulna, Bangladesh
Humayan Kabir Rana: Department of Computer Science and Engineering, Green University of Bangladesh, Narayanganj 1461, Dhaka, Bangladesh
Arpa Kar Puza: Department of Computer Science and Engineering, Pabna University of Science and Technology, Pabna, Bangladesh
Md Imam Hasan: Department of Computer Science and Engineering, Green University of Bangladesh, Narayanganj 1461, Dhaka, Bangladesh
Shudeb Babu Sen Omit: Institute of Information Technology, Noakhali Science and Technology University, Noakhali, Bangladesh
Pintu Chandra Shill: Department of Computer Science and Engineering, Khulna University of Engineering & Technology, Khulna, Bangladesh
Md Abdur Rahim: Department of Computer Science and Engineering, Pabna University of Science and Technology, Pabna, Bangladesh
Rittika Shamsuddin: Department of Computer Science, Oklahoma State University, USA
Bidhan Chandra Podder: Dept. of Pediatrics, Institute of Child and Mother Health, Dhaka, Bangladesh
Md Habibur Rahman: Department of Computer Science and Engineering, Islamic University, Kushita 7003, Bangladesh; Corresponding author.

Journal volume & issue: Vol. 47
p. 101505

Abstract

Read online

Glioblastoma, also known as grade IV astrocytoma, is an aggressive and quickly developing brain tumor whose median survival period is believed to be between 12 and 18 months. Patients with glioblastoma are at high risk of developing comorbidities like leukemia, atherosclerosis, autism, sudden cardiac death, and pancreatic neoplasms. Identification of influential biomarker genes is crucial to diagnose and design therapeutic targets for cancer. To do this, we considered The Cancer Genome Atlas (TCGA) dataset to identify the significant genes of glioblastoma. Therefore, we pre-processed the dataset and applied the Kruskal-Wallis test and Bonferroni correction methods to select significant biomarker genes. A total of 26 significant dysregulated genes have been identified from 16261 genes, of which 19 are up-regulated and 7 are down-regulated genes. We performed analysis of functional and ontological pathways, protein-protein interactions (PPI), and protein-drug interactions (PDI) to predict the functions of these influential genes. Comorbidities validation was performed using gold benchmark databases. Furthermore, the Cox proportional hazard model and the product-limit (PL) estimator were used to examine the influence of clinical and genetic variables that play an important role in the survival of glioblastoma patients. This study provides the basis of identifying cancer-influencing genes and understanding the impact of glioblastoma on the progression of comorbidities.

Published in Informatics in Medicine Unlocked

ISSN: 2352-9148 (Online)
Publisher: Elsevier
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: https://www.journals.elsevier.com/informatics-in-medicine-unlocked/

About the journal

Abstract

Keywords