Heterogeneous Defect Prediction Based on Federated Reinforcement Learning via Gradient Clustering

Aili Wang; Yinghui Zhao; Guodong Li; Jun Zhang; Haibin Wu; Yuji Iwahori

doi:10.1109/ACCESS.2022.3195039

IEEE Access (Jan 2022)

Heterogeneous Defect Prediction Based on Federated Reinforcement Learning via Gradient Clustering

Aili Wang,
Yinghui Zhao,
Guodong Li,
Jun Zhang,
Haibin Wu,
Yuji Iwahori

Affiliations

Aili Wang: ORCiD; Heilongjiang Province Key Laboratory of Laser Spectroscopy Technology and Application, Harbin University of Science and Technology, Harbin, China
Yinghui Zhao: ORCiD; Heilongjiang Province Key Laboratory of Laser Spectroscopy Technology and Application, Harbin University of Science and Technology, Harbin, China
Guodong Li: ORCiD; Heilongjiang Province Key Laboratory of Laser Spectroscopy Technology and Application, Harbin University of Science and Technology, Harbin, China
Jun Zhang: China Energy Taishan Power, Taishan, China
Haibin Wu: ORCiD; Heilongjiang Province Key Laboratory of Laser Spectroscopy Technology and Application, Harbin University of Science and Technology, Harbin, China
Yuji Iwahori: ORCiD; Department of Computer Science, Chubu University, Kasugai, Aichi, Japan

DOI: https://doi.org/10.1109/ACCESS.2022.3195039
Journal volume & issue: Vol. 10
pp. 87832 – 87843

Abstract

Read online

Heterogeneous defect prediction (HDP) refers to using heterogeneous data collected by other projects to build a defect prediction model to predict the software defects in a project. Traditional methods usually involve the measurement of the source project and the target project. However, due to the limitations of laws and regulations, these original data are not easy to obtain, which forms a data island. As a new machine learning paradigm, federated learning (FL) has great advantages in training heterogeneous data and data island. In order to solve the data island and data heterogeneity of HDP, we propose a novel Federated Reinforcement Learning via Gradient Clustering (FRLGC) method in this paper. Firstly, the parameters of the global model are transferred to each dueling deep Q network (dueling DQN) model and each client uses private data to train the dueling model which combines experience replay to increase data efficiency in limited datasets. Secondly, gaussian differential privacy is used to encrypt the model parameters to ensure the privacy and security of the model. Finally, we cluster the clients according to their locally encrypted model parameters and use weighted average to aggregate to create a new global model locally and globally. Experiments on nine projects in three public databases (Relink, NASA and AEEEM) show that FRLGC is superior to the relevant HDP solutions.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords