The Effects of Weight Quantization on Online Federated Learning for the IoT: A Case Study

Nil Llisterri Gimenez; Junkyu Lee; Felix Freitag; Hans Vandierendonck

doi:10.1109/ACCESS.2024.3349557

IEEE Access (Jan 2024)

The Effects of Weight Quantization on Online Federated Learning for the IoT: A Case Study

Nil Llisterri Gimenez,
Junkyu Lee,
Felix Freitag,
Hans Vandierendonck

Affiliations

Nil Llisterri Gimenez: ORCiD; Department of Computer Architecture, Technical University of Catalonia, Barcelona, Spain
Junkyu Lee: ORCiD; Institute for Analytics and Data Science, University of Essex, Colchester, U.K.
Felix Freitag: ORCiD; Department of Computer Architecture, Technical University of Catalonia, Barcelona, Spain
Hans Vandierendonck: ORCiD; Institute of Electronics, Communications and Information Technology, Queen’s University Belfast, Belfast, U.K.

DOI: https://doi.org/10.1109/ACCESS.2024.3349557
Journal volume & issue: Vol. 12
pp. 5490 – 5502

Abstract

Read online

Many weight quantization approaches were explored to save the communication bandwidth between the clients and the server in federated learning using high-end computing machines. However, there is a lack of weight quantization research for online federated learning using TinyML devices which are restricted by the mini-batch size, the neural network size, and the communication method due to their severe hardware resource constraints and power budgets. We name Tiny Online Federated Learning (TinyOFL) for online federated learning using TinyML devices in the Internet of Things (IoT). This paper performs a comprehensive analysis of the effects of weight quantization in TinyOFL in terms of accuracy, stability, overfitting, communication efficiency, energy consumption, and delivery time, and extracts practical guidelines on how to apply the weight quantization to TinyOFL. Our analysis is supported by a TinyOFL case study with three Arduino Portenta H7 boards running federated learning clients for a keyword spotting task. Our findings include that in TinyOFL, a more aggressive weight quantization can be allowed than in online learning without FL, without affecting the accuracy thanks to TinyOFL’s quasi-batch training property. For example, using 7-bit weights achieved the equivalent accuracy to 32-bit floating point weights, while saving communication bandwidth by $4.6 \times $ . Overfitting by increasing network width rarely occurs in TinyOFL, but may occur if strong weight quantization is applied. The experiments also showed that there is a design space for TinyOFL applications by compensating for the accuracy loss due to weight quantization with an increase of the neural network size.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords