SWOSBC: A Novel Optimizer for Learning Convolutional Neural Network

Shubhankar Bhakta; Utpal Nandi; Kuheli Ray Mahapatra; Moirangthem Marjit Singh; Abdulfattah Noorwali

doi:10.1109/ACCESS.2024.3481640

IEEE Access (Jan 2024)

SWOSBC: A Novel Optimizer for Learning Convolutional Neural Network

Shubhankar Bhakta,
Utpal Nandi,
Kuheli Ray Mahapatra,
Moirangthem Marjit Singh,
Abdulfattah Noorwali

Affiliations

Shubhankar Bhakta: ORCiD; Department of Computer Science, Vidyasagar University, Midnapore, West Bengal, India
Utpal Nandi: ORCiD; Department of Computer Science, Vidyasagar University, Midnapore, West Bengal, India
Kuheli Ray Mahapatra: ORCiD; Department of Computer Science, Bajkul Milani Mahavidyalaya, Bajkul, West Bengal, India
Moirangthem Marjit Singh: ORCiD; Department of Computer Science and Engineering, North Eastern Regional Institute of Science and Technology, Nirjuli, Arunachal Pradesh, India
Abdulfattah Noorwali: ORCiD; Department of Electrical Engineering, Umm Al-Qura University, Makkah, Saudi Arabia

DOI: https://doi.org/10.1109/ACCESS.2024.3481640
Journal volume & issue: Vol. 12
pp. 156458 – 156470

Abstract

Read online

Deep Neural Networks (DNNs) that aim to maximize accuracy and decrease loss can be trained using optimization algorithms. One of the most significant fields of research is the creation of an efficient optimization technique. Most adaptive optimizers, including Adam and diffGrad, are unable to address noisy updating or zigzag behavior introduced in the optimization process. Moreover, the Adam technique over fits a model in certain situations, especially when the training dataset is small. This ultimately leads to poor generalization efficiency on test data. To get over this shortcoming, an optimization method using the square root of the exponentially weighted average and regulating the step size without applying the second bias correction in addition (SWOSBC) has been proposed that produces second-order moments using both the first and second decay rates instead of only the second decay rate and the second momentum instead of the second bias correction used as the denominator. Using an adaptive term that uses the exponentially weighted average, the suggested SWOSBC offers a smoother trajectory and greater picture Classification Accuracy (CA). The comprehensive tests using standard datasets (CIFAR10, CIFAR100, MNIST, and ImageNet) in comparison to cutting-edge techniques show that SWOSBC performs better. On the CIFAR10 and MNIST datasets for every one of the tested networks, as well as on the CIFAR-100 dataset for the majority of examined network models, the SWOSBC algorithm provides the most effective CA. Using the ImageNet dataset and the ResNet-18 network yields the best classification accuracy. By employing the Rosenbrock functions of convergence and linear regression, it can be observed how smoothly and quickly the SWOSBC reaches the global minima. Source code link: https://github.com/UtpalNandi/SWOSBC.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords