Design Space Exploration of a Sparse MobileNetV2 Using High-Level Synthesis and Sparse Matrix Techniques on FPGAs

Antonios Tragoudaras; Pavlos Stoikos; Konstantinos Fanaras; Athanasios Tziouvaras; George Floros; Georgios Dimitriou; Kostas Kolomvatsos; Georgios Stamoulis

doi:10.3390/s22124318

Sensors (Jun 2022)

Design Space Exploration of a Sparse MobileNetV2 Using High-Level Synthesis and Sparse Matrix Techniques on FPGAs

Antonios Tragoudaras,
Pavlos Stoikos,
Konstantinos Fanaras,
Athanasios Tziouvaras,
George Floros,
Georgios Dimitriou,
Kostas Kolomvatsos,
Georgios Stamoulis

Affiliations

Antonios Tragoudaras: Department of Electrical and Computer Engineering, University of Thessaly, 383 34 Volos, Greece
Pavlos Stoikos: Department of Electrical and Computer Engineering, University of Thessaly, 383 34 Volos, Greece
Konstantinos Fanaras: Department of Electrical and Computer Engineering, University of Thessaly, 383 34 Volos, Greece
Athanasios Tziouvaras: Department of Electrical and Computer Engineering, University of Thessaly, 383 34 Volos, Greece
George Floros: Department of Electrical and Computer Engineering, University of Thessaly, 383 34 Volos, Greece
Georgios Dimitriou: Department of Informatics and Telecommunications, University of Thessaly, 35 100 Lamia, Greece
Kostas Kolomvatsos: Department of Informatics and Telecommunications, University of Thessaly, 35 100 Lamia, Greece
Georgios Stamoulis: Department of Electrical and Computer Engineering, University of Thessaly, 383 34 Volos, Greece

DOI: https://doi.org/10.3390/s22124318
Journal volume & issue: Vol. 22, no. 12
p. 4318

Abstract

Read online

Convolution Neural Networks (CNNs) are gaining ground in deep learning and Artificial Intelligence (AI) domains, and they can benefit from rapid prototyping in order to produce efficient and low-power hardware designs. The inference process of a Deep Neural Network (DNN) is considered a computationally intensive process that requires hardware accelerators to operate in real-world scenarios due to the low latency requirements of real-time applications. As a result, High-Level Synthesis (HLS) tools are gaining popularity since they provide attractive ways to reduce design time complexity directly in register transfer level (RTL). In this paper, we implement a MobileNetV2 model using a state-of-the-art HLS tool in order to conduct a design space exploration and to provide insights on complex hardware designs which are tailored for DNN inference. Our goal is to combine design methodologies with sparsification techniques to produce hardware accelerators that achieve comparable error metrics within the same order of magnitude with the corresponding state-of-the-art systems while also significantly reducing the inference latency and resource utilization. Toward this end, we apply sparse matrix techniques on a MobileNetV2 model for efficient data representation, and we evaluate our designs in two different weight pruning approaches. Experimental results are evaluated with respect to the CIFAR-10 data set using several different design methodologies in order to fully explore their effects on the performance of the model under examination.

Published in Sensors

ISSN: 1424-8220 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Chemical technology
Website: http://www.mdpi.com/journal/sensors

About the journal

Abstract

Keywords