Heliyon (Aug 2024)

Performance enhancement of classifiers through Bio inspired feature selection methods for early detection of lung cancer from microarray genes

  • Karthika M S,
  • Harikumar Rajaguru,
  • Ajin R. Nair

Journal volume & issue
Vol. 10, no. 16
p. e36419

Abstract

Read online

Gene expression in the microarray is assimilated with redundant and high-dimensional information. Moreover, the information in the microarray genes mostly correlates with background noise. This paper uses dimensionality reduction and feature selection methods to employ a classification methodology for high-dimensional lung cancer microarray data. The approach is enforced in two phases; initially, the genes are dimensionally reduced through Hilbert Transform, Detrend Fluctuation Analysis and Least Square Linear Regression methods. The dimensionally reduced data is further optimized in the next phase using Elephant Herd optimization (EHO) and Cuckoo Search Feature selection methods. The classifiers used here are Bayesian Linear Discriminant, Naive Bayes, Random Forest, Decision Tree, SVM (Linear), SVM (Polynomial), and SVM (RBF). The classifier's performances are analysed with and without feature selection methods. The SVM (Linear) classifier with the DFA Dimensionality Reduction method and EHO feature selection achieved the highest accuracy of 92.26 % compared to other classifiers.

Keywords