International Journal of Data and Network Science (Jan 2024)

Multi-objective of wind-driven optimization as feature selection and clustering to enhance text clustering

  • Mehdi G. Duaimi,
  • Qusay Bsoul,
  • Abbas F. J. AL-Gburi

DOI
https://doi.org/10.5267/j.ijdns.2024.1.014
Journal volume & issue
Vol. 8, no. 3
pp. 1985 – 1998

Abstract

Read online

Text Clustering consists of grouping objects of similar categories. The initial centroids influence operation of the system with the potential to become trapped in local optima. The second issue pertains to the impact of a huge number of features on the determination of optimal initial centroids. The problem of dimensionality may be reduced by feature selection. Therefore, Wind Driven Optimization (WDO) was employed as Feature Selection to reduce the unimportant words from the text. In addition, the current study has integrated a novel clustering optimization technique called the WDO (Wasp Swarm Optimization) to effectively determine the most suitable initial centroids. The result showed the new meta-heuristic which is WDO was employed as the multi-objective first time as unsupervised Feature Selection (WDOFS) and the second time as a Clustering algorithm (WDOC). For example, the WDOC outperformed Harmony Search and Particle Swarm in terms of F-measurement by 93.3%; in contrast, text clustering's performance improves 0.9% because of using suggested clustering on the proposed feature selection. With WDOFS more than 50 percent of features have been removed from the other examination of features. The best result got the multi-objectives with F-measurement 98.3%.