Algorithms (Dec 2020)

Nature-Inspired Optimization Algorithms for Text Document Clustering—A Comprehensive Analysis

  • Laith Abualigah,
  • Amir H. Gandomi,
  • Mohamed Abd Elaziz,
  • Abdelazim G. Hussien,
  • Ahmad M. Khasawneh,
  • Mohammad Alshinwan,
  • Essam H. Houssein

DOI
https://doi.org/10.3390/a13120345
Journal volume & issue
Vol. 13, no. 12
p. 345

Abstract

Read online

Text clustering is one of the efficient unsupervised learning techniques used to partition a huge number of text documents into a subset of clusters. In which, each cluster contains similar documents and the clusters contain dissimilar text documents. Nature-inspired optimization algorithms have been successfully used to solve various optimization problems, including text document clustering problems. In this paper, a comprehensive review is presented to show the most related nature-inspired algorithms that have been used in solving the text clustering problem. Moreover, comprehensive experiments are conducted and analyzed to show the performance of the common well-know nature-inspired optimization algorithms in solving the text document clustering problems including Harmony Search (HS) Algorithm, Genetic Algorithm (GA), Particle Swarm Optimization (PSO) Algorithm, Ant Colony Optimization (ACO), Krill Herd Algorithm (KHA), Cuckoo Search (CS) Algorithm, Gray Wolf Optimizer (GWO), and Bat-inspired Algorithm (BA). Seven text benchmark datasets are used to validate the performance of the tested algorithms. The results showed that the performance of the well-known nurture-inspired optimization algorithms almost the same with slight differences. For improvement purposes, new modified versions of the tested algorithms can be proposed and tested to tackle the text clustering problems.

Keywords