A Derivative-Incorporated Adaptive Gradient Method for Federated Learning

Huimin Gao; Qingtao Wu; Hongyan Cao; Xuhui Zhao; Junlong Zhu; Mingchuan Zhang

doi:10.3390/math11153403

Mathematics (Aug 2023)

A Derivative-Incorporated Adaptive Gradient Method for Federated Learning

Huimin Gao,
Qingtao Wu,
Hongyan Cao,
Xuhui Zhao,
Junlong Zhu,
Mingchuan Zhang

Affiliations

Huimin Gao: College of Information Engineering, Henan University of Science and Technology, Luoyang 471023, China
Qingtao Wu: College of Information Engineering, Henan University of Science and Technology, Luoyang 471023, China
Hongyan Cao: China Research Institute of Radiowave Propagation, Qingdao 266107, China
Xuhui Zhao: College of Information Engineering, Henan University of Science and Technology, Luoyang 471023, China
Junlong Zhu: College of Information Engineering, Henan University of Science and Technology, Luoyang 471023, China
Mingchuan Zhang: College of Information Engineering, Henan University of Science and Technology, Luoyang 471023, China

DOI: https://doi.org/10.3390/math11153403
Journal volume & issue: Vol. 11, no. 15
p. 3403

Abstract

Read online

As a new machine learning technique, federated learning has received more attention in recent years, which enables decentralized model training across data silos or edge intelligent devices in the Internet of Things without exchanging local raw data. All kinds of algorithms are proposed to solve the challenges in federated learning. However, most of these methods are based on stochastic gradient descent, which undergoes slow convergence and unstable performance during the training stage. In this paper, we propose a differential adaptive federated optimization method, which incorporates an adaptive learning rate and the gradient difference into the iteration rule of the global model. We further adopt the first-order moment estimation to compute the approximate value of the differential term so as to avoid amplifying the random noise from the input data sample. The theoretical convergence guarantee is established for our proposed method in a stochastic non-convex setting under full client participation and partial client participation cases. Experiments for the image classification task are performed on two standard datasets by training a neural network model, and experiment results on different baselines demonstrate the effectiveness of our proposed method.

Published in Mathematics

ISSN: 2227-7390 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Mathematics
Website: http://www.mdpi.com/journal/mathematics

About the journal

Abstract

Keywords