Pipelined Stochastic Gradient Descent with Taylor Expansion

Bongwon Jang; Inchul Yoo; Dongsuk Yook

doi:10.3390/app132111730

Applied Sciences (Oct 2023)

Pipelined Stochastic Gradient Descent with Taylor Expansion

Bongwon Jang,
Inchul Yoo,
Dongsuk Yook

Affiliations

Bongwon Jang: Department of Computer Science and Engineering, Korea University, Seoul 02841, Republic of Korea
Inchul Yoo: Artificial Intelligence Laboratory, Department of Computer Science and Engineering, Korea University, Seoul 02841, Republic of Korea
Dongsuk Yook: Artificial Intelligence Laboratory, Department of Computer Science and Engineering, Korea University, Seoul 02841, Republic of Korea

DOI: https://doi.org/10.3390/app132111730
Journal volume & issue: Vol. 13, no. 21
p. 11730

Abstract

Read online

Stochastic gradient descent (SGD) is an optimization method typically used in deep learning to train deep neural network (DNN) models. In recent studies for DNN training, pipeline parallelism, a type of model parallelism, is proposed to accelerate SGD training. However, since SGD is inherently sequential, naively implemented pipeline parallelism introduces the weight inconsistency and the delayed gradient problems, resulting in reduced training efficiency. In this study, we propose a novel method called TaylorPipe to alleviate these problems. The proposed method generates multiple model replicas to solve the weight inconsistency problem, and adopts a Taylor expansion-based gradient prediction algorithm to mitigate the delayed gradient problem. We verified the efficiency of the proposed method using the VGG-16 and the ResNet-34 on the CIFAR-10 and CIFAR-100 datasets. The experimental results show that not only the training time is reduced by up to 2.7 times but also the accuracy of TaylorPipe is comparable with that of SGD.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords