Toward Highly-Efficient and Accurate Services QoS Prediction via Machine Unlearning

Yuxiang Zeng; Jianlong Xu; Yuhui Li; Caiyi Chen; Qingcao Dai; Zibo Du

doi:10.1109/ACCESS.2023.3291410

IEEE Access (Jan 2023)

Toward Highly-Efficient and Accurate Services QoS Prediction via Machine Unlearning

Yuxiang Zeng,
Jianlong Xu,
Yuhui Li,
Caiyi Chen,
Qingcao Dai,
Zibo Du

Affiliations

Yuxiang Zeng: ORCiD; College of Engineering, Shantou University, Shantou, China
Jianlong Xu: ORCiD; College of Engineering, Shantou University, Shantou, China
Yuhui Li: College of Computer Science and Electronic Engineering, Hunan University, Changsha, China
Caiyi Chen: College of Engineering, Shantou University, Shantou, China
Qingcao Dai: College of Engineering, Shantou University, Shantou, China
Zibo Du: College of Engineering, Shantou University, Shantou, China

DOI: https://doi.org/10.1109/ACCESS.2023.3291410
Journal volume & issue: Vol. 11
pp. 76242 – 76254

Abstract

Read online

Personalized Internet of Things (IoT) services prediction based on Quality-of-Service (QoS) is an indispensable technique for selecting appropriate services for each user. However, existing collaborative prediction models do not take into account the user’s authority to manage their own generated data. From the standpoint of users, the expectation is for models to eliminate the impact of their sensitive data to the greatest extent possible. Meanwhile, IoT service providers face the challenge of data contamination during service provision, which necessitates models to forget data quickly and accurately to restore performance. Furthermore, existing QoS prediction methods usually suffer from low model availability when handling unlearning requests by full retraining. This underscores the need to address security, availability, fidelity, privacy, and related issues, highlighting the urgency of unlearning. To solve the problem, we propose Context-Aware Data Driven Eraser (CADDEraser), a novel efficient machine unlearning framework for QoS prediction tasks. Firstly, we divide the training data into multiple shards to train submodels and obtain node embeddings by utilizing contextual information to derive graph embeddings. Then these embeddings are employed in a balanced clustering partition, ensuring the preservation of the QoS record between users and services. Finally, we use a concatenate aggregation method and stacking & attention-based aggregation methods to synthesize information from sub-models more efficiently. Experiments on large-scale datasets show that our CADDEraser framework not only improves efficiency but also enhances the accuracy of QoS prediction, achieving efficient unlearning and outperforms state-of-the-art unlearning approaches. Source codes are available at https://github.com/ZengYuXiang7/CADDEraser.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords