Lazy Aggregation for Heterogeneous Federated Learning

Gang Xu; De-Lun Kong; Xiu-Bo Chen; Xin Liu

doi:10.3390/app12178515

Applied Sciences (Aug 2022)

Lazy Aggregation for Heterogeneous Federated Learning

Gang Xu,
De-Lun Kong,
Xiu-Bo Chen,
Xin Liu

Affiliations

Gang Xu: School of Information Science and Technology, North China University of Technology, Beijing 100144, China
De-Lun Kong: School of Information Science and Technology, North China University of Technology, Beijing 100144, China
Xiu-Bo Chen: Information Security Center, State Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing 100876, China
Xin Liu: School of Information Engineering, Inner Mongolia University of Science and Technology, Baotou 014010, China

DOI: https://doi.org/10.3390/app12178515
Journal volume & issue: Vol. 12, no. 17
p. 8515

Abstract

Read online

Federated learning (FL) is a distributed neural network training paradigm with privacy protection. With the premise of ensuring that local data isn’t leaked, multi-device cooperation trains the model and improves its normalization. Unlike centralized training, FL is susceptible to heterogeneous data, biased gradient estimations hinder convergence of the global model, and traditional sampling techniques cannot apply FL due to privacy constraints. Therefore, this paper proposes a novel FL framework, federated lazy aggregation (FedLA), which reduces aggregation frequency to obtain high-quality gradients and improve robustness in non-IID. To judge the aggregating timings, the change rate of the models’ weight divergence (WDR) is introduced to FL. Furthermore, the collected gradients also facilitate FL walking out of the saddle point without extra communications. The cross-device momentum (CDM) mechanism could significantly improve the upper limit performance of the global model in non-IID. We evaluate the performance of several popular algorithms, including FedLA and FedLA with momentum (FedLAM). The results show that FedLAM achieves the best performance in most scenarios and the performance of the global model can also be improved in IID scenarios.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords