Distributed Stochastic Gradient Descent With Compressed and Skipped Communication

Tran Thi Phuong; Le Trieu Phong; Kazuhide Fukushima

doi:10.1109/ACCESS.2023.3315331

IEEE Access (Jan 2023)

Distributed Stochastic Gradient Descent With Compressed and Skipped Communication

Tran Thi Phuong,
Le Trieu Phong,
Kazuhide Fukushima

Affiliations

Tran Thi Phuong: KDDI Research Inc., Saitama, Japan
Le Trieu Phong: ORCiD; National Institute of Information and Communications Technology (NICT), Tokyo, Japan
Kazuhide Fukushima: ORCiD; KDDI Research Inc., Saitama, Japan

DOI: https://doi.org/10.1109/ACCESS.2023.3315331
Journal volume & issue: Vol. 11
pp. 99836 – 99846

Abstract

Read online

This paper introduces CompSkipDSGD, a new algorithm for distributed stochastic gradient descent that aims to improve communication efficiency by compressing and selectively skipping communication. In addition to compression, CompSkipDSGD allows both workers and the server to skip communication in any iteration of the training process and reserve it for future iterations without significantly decreasing testing accuracy. Our experimental results on the large-scale ImageNet dataset demonstrate that CompSkipDSGD can save hundreds of gigabytes of communication while maintaining similar levels of accuracy compared to state-of-the-art algorithms. The experimental results are supported by a theoretical analysis that demonstrates the convergence of CompSkipDSGD under established assumptions. Overall, CompSkipDSGD could be useful for reducing communication costs in distributed deep learning and enabling the use of large-scale datasets and models in complex environments.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords