Optimization of Convolutional Neural Networks Architectures Using PSO for Sign Language Recognition

Jonathan Fregoso; Claudia I. Gonzalez; Gabriela E. Martinez

doi:10.3390/axioms10030139

Axioms (Jun 2021)

Optimization of Convolutional Neural Networks Architectures Using PSO for Sign Language Recognition

Jonathan Fregoso,
Claudia I. Gonzalez,
Gabriela E. Martinez

Affiliations

Jonathan Fregoso: Division of Graduate Studies and Research, Tijuana Institute of Technology, Tijuana 22414, Mexico
Claudia I. Gonzalez: Division of Graduate Studies and Research, Tijuana Institute of Technology, Tijuana 22414, Mexico
Gabriela E. Martinez: Division of Graduate Studies and Research, Tijuana Institute of Technology, Tijuana 22414, Mexico

DOI: https://doi.org/10.3390/axioms10030139
Journal volume & issue: Vol. 10, no. 3
p. 139

Abstract

Read online

This paper presents an approach to design convolutional neural network architectures, using the particle swarm optimization algorithm. The adjustment of the hyper-parameters and finding the optimal network architecture of convolutional neural networks represents an important challenge. Network performance and achieving efficient learning models for a particular problem depends on setting hyper-parameter values and this implies exploring a huge and complex search space. The use of heuristic-based searches supports these types of problems; therefore, the main contribution of this research work is to apply the PSO algorithm to find the optimal parameters of the convolutional neural networks which include the number of convolutional layers, the filter size used in the convolutional process, the number of convolutional filters, and the batch size. This work describes two optimization approaches; the first, the parameters obtained by PSO are kept under the same conditions in each convolutional layer, and the objective function evaluated by PSO is given by the classification rate; in the second, the PSO generates different parameters per layer, and the objective function is composed of the recognition rate in conjunction with the Akaike information criterion, the latter helps to find the best network performance but with the minimum parameters. The optimized architectures are implemented in three study cases of sign language databases, in which are included the Mexican Sign Language alphabet, the American Sign Language MNIST, and the American Sign Language alphabet. According to the results, the proposed methodologies achieved favorable results with a recognition rate higher than 99%, showing competitive results compared to other state-of-the-art approaches.

Published in Axioms

ISSN: 2075-1680 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Mathematics
Website: http://www.mdpi.com/journal/axioms

About the journal

Abstract

Keywords