Mathematics (Feb 2023)

Automated CNN Architectural Design: A Simple and Efficient Methodology for Computer Vision Tasks

  • Ali Al Bataineh,
  • Devinder Kaur,
  • Mahmood Al-khassaweneh,
  • Esraa Al-sharoa

DOI
https://doi.org/10.3390/math11051141
Journal volume & issue
Vol. 11, no. 5
p. 1141

Abstract

Read online

Convolutional neural networks (CNN) have transformed the field of computer vision by enabling the automatic extraction of features, obviating the need for manual feature engineering. Despite their success, identifying an optimal architecture for a particular task can be a time-consuming and challenging process due to the vast space of possible network designs. To address this, we propose a novel neural architecture search (NAS) framework that utilizes the clonal selection algorithm (CSA) to automatically design high-quality CNN architectures for image classification problems. Our approach uses an integer vector representation to encode CNN architectures and hyperparameters, combined with a truncated Gaussian mutation scheme that enables efficient exploration of the search space. We evaluated the proposed method on six challenging EMNIST benchmark datasets for handwritten digit recognition, and our results demonstrate that it outperforms nearly all existing approaches. In addition, our approach produces state-of-the-art performance while having fewer trainable parameters than other methods, making it low-cost, simple, and reusable for application to multiple datasets.

Keywords