Communication-Efficient Zeroth-Order Adaptive Optimization for Federated Learning

Ping Xie; Xiangrui Gao; Fan Li; Ling Xing; Yu Zhang; Hanxiao Sun

doi:10.3390/math12081148

Mathematics (Apr 2024)

Communication-Efficient Zeroth-Order Adaptive Optimization for Federated Learning

Ping Xie,
Xiangrui Gao,
Fan Li,
Ling Xing,
Yu Zhang,
Hanxiao Sun

Affiliations

Ping Xie: School of Information Engineering, Henan University of Science and Technology, Luoyang 471023, China
Xiangrui Gao: School of Information Engineering, Henan University of Science and Technology, Luoyang 471023, China
Fan Li: School of Information Engineering, Henan University of Science and Technology, Luoyang 471023, China
Ling Xing: School of Information Engineering, Henan University of Science and Technology, Luoyang 471023, China
Yu Zhang: School of Information Engineering, Henan University of Science and Technology, Luoyang 471023, China
Hanxiao Sun: School of Information Engineering, Henan University of Science and Technology, Luoyang 471023, China

DOI: https://doi.org/10.3390/math12081148
Journal volume & issue: Vol. 12, no. 8
p. 1148

Abstract

Read online

Federated learning has become a prevalent distributed training paradigm, in which local devices collaboratively train learning models without exchanging local data. One of the most dominant frameworks of federated learning (FL) is FedAvg, since it is efficient and simple to implement; here, the first-order information is generally utilized to train the parameters of learning models. In practice, however, the gradient information may be unavailable or infeasible in some applications, such as federated black-box optimization problems. To solve the issue, we propose an innovative zeroth-order adaptive federated learning algorithm without using the gradient information, referred to as ZO-AdaFL, which integrates the zeroth-order optimization algorithm into the adaptive gradient method. Moreover, we also rigorously analyze the convergence behavior of ZO-AdaFL in a non-convex setting, i.e., where ZO-AdaFL achieves convergence to a region close to a stationary point at a speed of O(1/T) (T represents the total iteration number). Finally, to verify the performance of ZO-AdaFL, simulation experiments are performed using the MNIST and FMNIST datasets. Our experimental findings demonstrate that ZO-AdaFL outperforms other state-of-the-art zeroth-order FL approaches in terms of both effectiveness and efficiency.

Published in Mathematics

ISSN: 2227-7390 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Mathematics
Website: http://www.mdpi.com/journal/mathematics

About the journal

Abstract

Keywords