Blackthorn: Latency Estimation Framework for CNNs on Embedded Nvidia Platforms

Martin Lechner; Axel Jantsch

doi:10.1109/ACCESS.2021.3101936

IEEE Access (Jan 2021)

Blackthorn: Latency Estimation Framework for CNNs on Embedded Nvidia Platforms

Martin Lechner,
Axel Jantsch

Affiliations

Martin Lechner: ORCiD; Institute of Computer Technology, TU Wien, Vienna, Austria
Axel Jantsch: ORCiD; Institute of Computer Technology, TU Wien, Vienna, Austria

DOI: https://doi.org/10.1109/ACCESS.2021.3101936
Journal volume & issue: Vol. 9
pp. 110074 – 110084

Abstract

Read online

With more powerful yet efficient embedded devices and accelerators being available for Deep Neural Networks (DNN), machine learning is becoming an integral part of edge computing. As the number of such devices increases, finding the best platform for a specific application has become more challenging. A common question for application developers is to find the most cost-effective combination of a DNN and a device while still meeting latency and accuracy requirements. In this work, we propose Blackthorn, a layer-wise latency estimation framework for embedded Nvidia GPUs based on analytical models. We provide accurate predictions for each layer, helping developers to find bottlenecks and optimize the architecture of a DNN to fit target platforms. Our framework can quickly evaluate and compare large amounts of network optimizations without needing to build time-consuming execution engines. Our experimental results on Jetson TX2 and Jetson Nano devices show a per-layer estimation error of 6.104% Root-Mean-Square-Percentage-Error (RMSPE) and 5.888% RMSPE, which significantly outperforms current state-of-the-art methods. At network level, the average latency error is below 3% for the tested DNNs.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords