Scientific African (Dec 2024)

Gene prioritization-based active bio-module identification for bioinformatics

  • Mukesh Soni,
  • Mohammed Wasim Bhatt,
  • Evans Asenso,
  • Dr. Mir Omar Jhon

Journal volume & issue
Vol. 26
p. e02466

Abstract

Read online

Massive multi-omics data are being used to research cancer pathogenesis at the molecular level as high-throughput sequencing technology advances. Many present approaches frequently fail to detect strongly coupled modules that are intimately associated with cancer. By combining two forms of omics data, a technique to active bio-module identification known as IdeMod is proposed, which employs gene expression and protein-protein interaction networks. IdeMod is a p-step random walk kernel regression model-based gene activity score algorithm that uses the Pareto optimum consensus (POC) method's dominance connections to generate a prioritised list of genes. IdeMod uses the SA GPROX simulated annealing technique to identify the PPI network's most linked and high-priority bio-modules. The techniques RegMod, LEAN, SigMod, ModFinder, and IdeMod were experimentally tested on real-world cervical and BRCA datasets. These findings show that the IdeMod algorithm may identify a densely linked module containing multiple genes that either promote or hinder tumour growth. The BRCA1 gene increases the likelihood of developing hereditary breast cancer associated with BRCA mutations. As a result, the IdeMod technique can be used in conjunction with other tools to detect bio-modules.

Keywords