Federated selective aggregation for on-device knowledge amalgamation

Donglin Xie; Ruonan Yu; Gongfan Fang; Jiaqi Han; Jie Song; Zunlei Feng; Li Sun; Mingli Song

Chip (Sep 2023)

Federated selective aggregation for on-device knowledge amalgamation

Donglin Xie,
Ruonan Yu,
Gongfan Fang,
Jiaqi Han,
Jie Song,
Zunlei Feng,
Li Sun,
Mingli Song

Affiliations

Donglin Xie: Key Laboratory of Visual Perception, Ministry of Education and Microsoft, Zhejiang University, Hangzhou 310000, China
Ruonan Yu: Key Laboratory of Visual Perception, Ministry of Education and Microsoft, Zhejiang University, Hangzhou 310000, China
Gongfan Fang: Key Laboratory of Visual Perception, Ministry of Education and Microsoft, Zhejiang University, Hangzhou 310000, China
Jiaqi Han: Key Laboratory of Visual Perception, Ministry of Education and Microsoft, Zhejiang University, Hangzhou 310000, China
Jie Song: School of Software Technology, Zhejiang University, Ningbo 315000, China
Zunlei Feng: School of Software Technology, Zhejiang University, Ningbo 315000, China
Li Sun: Ningbo Innovation Center, Zhejiang University, Ningbo 315000, China
Mingli Song: Corresponding author.; Key Laboratory of Visual Perception, Ministry of Education and Microsoft, Zhejiang University, Hangzhou 310000, China

Journal volume & issue: Vol. 2, no. 3
p. 100053

Abstract

Read online

ABSTRACT: In the current work, we explored a new knowledge amalgamation problem, termed Federated Selective Aggregation for on-device knowledge amalgamation (FedSA). FedSA aims to train an on-device student model for a new task with the help of several decentralized teachers whose pre-training tasks and data are different and agnostic. The motivation to investigate such a problem setup stems from a recent dilemma of model sharing. Due to privacy, security or intellectual property issues, the pre-trained models are, however, not able to be shared, and the resources of devices are usually limited. The proposed FedSA offers a solution to this dilemma and makes it one step further, again, the method can be employed on low-power and resource-limited devices. To this end, a dedicated strategy was proposed to handle the knowledge amalgamation. Specifically, the student-training process in the current work was driven by a novel saliency-based approach which adaptively selects teachers as the participants and integrated their representative capabilities into the student. To evaluate the effectiveness of FedSA, experiments on both single-task and multi-task settings were conducted. The experimental results demonstrate that FedSA could effectively amalgamate knowledge from decentralized models and achieve competitive performance to centralized baselines.

Published in Chip

ISSN: 2709-4723 (Print); 2772-2724 (Online)
Publisher: Elsevier
Country of publisher: Netherlands
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Information technology
Website: https://www.sciencedirect.com/journal/chip

About the journal

Abstract

Keywords