IEEE Access (Jan 2025)
Fine Tuning Hyperparameters of Deep Learning Models Using Metaheuristic Accelerated Particle Swarm Optimization Algorithm
Abstract
In recent years, Convolutional Neural Networks (CNNs) have emerged as powerful tools for solving complex real-world problems, particularly in the domain of image processing. The success of CNNs can be attributed to their ability to learn hierarchical representations from data. However, achieving optimal performance with CNNs often necessitates fine-tuning a myriad of hyperparameters, such as learning rates, batch sizes, and network architectures. This tuning process typically relies on expert judgment and can be time-consuming and resource-intensive. To address this challenge, a novel metaheuristic-based optimization approach called Accelerated Particle Swarm Optimization for CNNs (APSO-CNN) is proposed. APSO-CNN uses the global search capabilities of Particle Swarm Optimization (PSO) to automatically optimize hyperparameter configurations for architecture-determined Convolutional Neural Networks (CNNs). The main contribution of this work is the development of the APSO-CNN framework, which introduces an improved PSO variant specifically tailored for Convolutional Neural Networks (CNNs) hyperparameter tuning, thereby reducing manual intervention and significantly enhancing model performance. The proposed APSO-CNN is evaluated across various Convolutional Neural Networks (CNNs) architectures and the Faces94 dataset, representing a diverse range of image processing tasks. Experiments demonstrate that the proposed automated hyperparameter fine-tuning approach consistently yields significant improvements in performance metrics, including Accuracy, Precision, Recall, and F1 Score. The APSO-CNN framework was experimentally validated on the Faces94 dataset comprising 3,060 facial images. These enhancements underscore the effectiveness of APSO-CNN in optimizing Convolutional Neural Networks (CNNs) for image processing applications. Accelerated Particle Swarm Optimization for CNNs (APSO-CNN) presents a promising solution to the challenge of hyperparameter optimization in Convolutional Neural Networks (CNNs). By automating this critical aspect of model development, Accelerated Particle Swarm Optimization for CNNs (APSO-CNN), streamlines the process, making it more efficient and accessible to researchers and practitioners. This contribution has the potential to accelerate advancements in image processing and related fields by enabling the rapid development of high-performance Convolutional Neural Networks (CNNs) models.
Keywords