Efficient Use of GPU Memory for Large-Scale Deep Learning Model Training

Hyeonseong Choi; Jaehwan Lee

doi:10.3390/app112110377

Applied Sciences (Nov 2021)

Efficient Use of GPU Memory for Large-Scale Deep Learning Model Training

Hyeonseong Choi,
Jaehwan Lee

Affiliations

Hyeonseong Choi: School of Electronics and Information Engineering, Korea Aerospace University, Goyang-si 10540, Korea
Jaehwan Lee: School of Electronics and Information Engineering, Korea Aerospace University, Goyang-si 10540, Korea

DOI: https://doi.org/10.3390/app112110377
Journal volume & issue: Vol. 11, no. 21
p. 10377

Abstract

Read online

To achieve high accuracy when performing deep learning, it is necessary to use a large-scale training model. However, due to the limitations of GPU memory, it is difficult to train large-scale training models within a single GPU. NVIDIA introduced a technology called CUDA Unified Memory with CUDA 6 to overcome the limitations of GPU memory by virtually combining GPU memory and CPU memory. In addition, in CUDA 8, memory advise options are introduced to efficiently utilize CUDA Unified Memory. In this work, we propose a newly optimized scheme based on CUDA Unified Memory to efficiently use GPU memory by applying different memory advise to each data type according to access patterns in deep learning training. We apply CUDA Unified Memory technology to PyTorch to see the performance of large-scale learning models through the expanded GPU memory. We conduct comprehensive experiments on how to efficiently utilize Unified Memory by applying memory advises when performing deep learning. As a result, when the data used for deep learning are divided into three types and a memory advise is applied to the data according to the access pattern, the deep learning execution time is reduced by 9.4% compared to the default Unified Memory.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords