Journal of King Saud University: Computer and Information Sciences (Sep 2022)

Identification of significant bio-markers from high-dimensional cancerous data employing a modified multi-objective meta-heuristic algorithm

  • Prajna Paramita Debata,
  • Puspanjali Mohapatra

Journal volume & issue
Vol. 34, no. 8
pp. 4743 – 4755

Abstract

Read online

Identification of the most prominent bio-markers or genes with high classification accuracy in the high-dimensional cancerous data, is still remained as an emerging challenge for the machine learning researchers. As this challenge has two objectives, i.e. minimizing the number of genes (NoG) and maximizing the classification accuracy percentage (CAP), this problem can be modeled as binary multi-objective approach. In this work, a modified version of multi-objective Jaya algorithm, multi-objective chaotic Jaya (MOCJaya), is suggested to select the minimum NoG with high CAP. Initially, a filter approach, namely Fisher score is applied to pre-select the informative genes. Then, MOCJaya algorithm is employed for both selecting key genes and classifying the cancer data. To assess the efficacy of the designed algorithm, ten binary and multi-class cancerous datasets are considered. Here, the suggested algorithm has been compared with multi-objective chaotic Genetic Algorithm (MOCGA), multi-objective chaotic particle swarm optimization (MOCPSO), multi-objective Jaya (MOJaya), multi-objective PSO (MOPSO), and non-dominated sorting GA (NSGA-II) models. Moreover, a comparison of MOCJaya algorithm with other seventeen existing models, is also performed here. The experimental results and comparison analysis reveal that MOCJaya classifies both the positive and negative samples of the cancer datasets in high CAP with a smaller NoG.

Keywords