Delay-Aware Online Resource Allocation for Buffer-Aided Synchronous Federated Learning Over Wireless Networks

Jing Liu; Jinke Zheng; Jing Zhang; Lin Xiang; Derrick Wing Kwan Ng; Xiaohu Ge

doi:10.1109/ACCESS.2024.3489657

IEEE Access (Jan 2024)

Delay-Aware Online Resource Allocation for Buffer-Aided Synchronous Federated Learning Over Wireless Networks

Jing Liu,
Jinke Zheng,
Jing Zhang,
Lin Xiang,
Derrick Wing Kwan Ng,
Xiaohu Ge

Affiliations

Jing Liu: School of Electronic Information and Communications, Huazhong University of Science and Technology, Wuhan, China
Jinke Zheng: ORCiD; School of Electronic Information and Communications, Huazhong University of Science and Technology, Wuhan, China
Jing Zhang: ORCiD; School of Electronic Information and Communications, Huazhong University of Science and Technology, Wuhan, China
Lin Xiang: ORCiD; Communications Engineering Laboratory, Technische Universität Darmstadt, Darmstadt, Germany
Derrick Wing Kwan Ng: ORCiD; School of Electrical Engineering and Telecommunications, University of New South Wales, Sydney, NSW, Australia
Xiaohu Ge: ORCiD; School of Electronic Information and Communications, Huazhong University of Science and Technology, Wuhan, China

DOI: https://doi.org/10.1109/ACCESS.2024.3489657
Journal volume & issue: Vol. 12
pp. 164862 – 164877

Abstract

Read online

Synchronous federated learning (FL) over wireless networks often suffers from the straggler effect, when the time required for training local models and uploading trained parameters varies significantly across heterogeneous wireless devices. This disparity prolongs the duration needed for model aggregation at the data center and slows down the convergence of synchronous FL, posing a significant challenge for FL over wireless networks. In this paper, we propose a novel buffer-aided FL scheme to mitigate the straggler effect. A buffer with sufficiently large storage is deployed at each wireless device to temporarily store the collected training data and adaptively outputs it during local training, according to the computational capabilities and communication data rates of the wireless devices. Consequently, all local models can be synchronously aggregated at the data center to reduce the number of rounds required for model aggregation in FL. To ensure timely information updates, a staleness function is further introduced to characterize the freshness of the data used to train local models. Additionally, the entropic value-at-risk (EVaR) of the data queues is introduced to eliminate the impact of discarded data at the buffers and improve the accuracy of trained local models. We formulate a delay-aware online stochastic optimization problem to minimize the long-term average staleness of all wireless devices for buffer-aided FL. Our problem formulation simultaneously guarantees the stability of data queues at the wireless devices and reduces the risk of data loss. By employing the Lyapunov optimization technique, we transform the problem into instantaneous deterministic optimization subproblems and further solve each subproblem online via utilizing its hidden convexity. Simulation results demonstrate that the proposed buffer-aided synchronous FL scheme can effectively improve the convergence rate of FL and, at the same time, ensure timely synchronization of heterogeneous wireless devices.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords