GPU Static Modeling Using PTX and Deep Structured Learning

Joao Guerreiro; Aleksandar Ilic; Nuno Roma; Pedro Tomas

doi:10.1109/ACCESS.2019.2951218

IEEE Access (Jan 2019)

GPU Static Modeling Using PTX and Deep Structured Learning

Joao Guerreiro,
Aleksandar Ilic,
Nuno Roma,
Pedro Tomas

Affiliations

Joao Guerreiro: ORCiD; INESC-ID, Instituto Superior Técnico, Universidade de Lisboa, Lisbon, Portugal
Aleksandar Ilic: ORCiD; INESC-ID, Instituto Superior Técnico, Universidade de Lisboa, Lisbon, Portugal
Nuno Roma: ORCiD; INESC-ID, Instituto Superior Técnico, Universidade de Lisboa, Lisbon, Portugal
Pedro Tomas: ORCiD; INESC-ID, Instituto Superior Técnico, Universidade de Lisboa, Lisbon, Portugal

DOI: https://doi.org/10.1109/ACCESS.2019.2951218
Journal volume & issue: Vol. 7
pp. 159150 – 159161

Abstract

Read online

In the quest for exascale computing, energy-efficiency is a fundamental goal in high-performance computing systems, typically achieved via dynamic voltage and frequency scaling (DVFS). However, this type of mechanism relies on having accurate methods of predicting the performance and power/energy consumption of such systems. Unlike previous works in the literature, this research focuses on creating novel GPU predictive models that do not require run-time information from the applications. The proposed models, implemented using recurrent neural networks, take into account the sequence of GPU assembly instructions (PTX) and can accurately predict changes in the execution time, power and energy consumption of applications when the frequencies of different GPU domains (core and memory) are scaled. Validated with 24 applications on GPUs from different NVIDIA microarchitectures (Turing, Volta, Pascal and Maxwell), the proposed models attain a significant accuracy. Particularly, the obtained power consumption scaling model provides an average error rate of 7.9% (Tesla T4), 6.7% (Titan V), 5.9% (Titan Xp) and 5.4% (GTX Titan X), which is comparable to state-of-the-art run-time counter-based models. When using the models to select the minimum-energy frequency configuration, significant energy savings can be attained: 8.0% (Tesla T4), 6.0% (Titan V), 29.0% (Titan Xp) and 11.5% (GTX Titan X).

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords