Multi-View Low-Rank Coding-Based Network Data De-Anonymization

Xingping Xian; Tao Wu; Shaojie Qiao; Wei Wang; Yanbing Liu; Nan Han

doi:10.1109/ACCESS.2020.2995568

IEEE Access (Jan 2020)

Multi-View Low-Rank Coding-Based Network Data De-Anonymization

Xingping Xian,
Tao Wu,
Shaojie Qiao,
Wei Wang,
Yanbing Liu,
Nan Han

Affiliations

Xingping Xian: Department of Computer Science and Technology, Chongqing University of Posts and Telecommunications, Chongqing, China
Tao Wu: ORCiD; School of Cyber Security and Information Law, Chongqing University of Posts and Telecommunications, Chongqing, China
Shaojie Qiao: School of Software Engineering, Chengdu University of Information Technology, Chengdu, China
Wei Wang: ORCiD; Institute of Cybersecurity, Sichuan University, Chengdu, China
Yanbing Liu: Chongqing Engineering Laboratory of Internet and Information Security, Chongqing University of Posts and Telecommunications, Chongqing, China
Nan Han: School of Management, Chengdu University of Information Technology, Chengdu, China

DOI: https://doi.org/10.1109/ACCESS.2020.2995568
Journal volume & issue: Vol. 8
pp. 94575 – 94593

Abstract

Read online

Social networks are extensively exploited by third-party consumers such as researchers and advertisers to understand user characteristics and behaviors. In general, before network data is published, sensitive relationships should be anonymized to prevent the compromise of individual privacy. To quantify the guarantee level of privacy-preserving mechanisms and mitigate users' privacy concerns, numerous studies concerning network data de-anonymization have been carried out. However, most existing studies focus on single-view data, and privacy protection for multi-view data that is ubiquitous in the era of big data has not been yet extensively explored. In this study, we are interested in answering the following question: Are the traditional privacy protection methods still valid for the anonymization of multi-view data? In this study, we propose a Multi-View Low-Rank Coding (MVLRC) based network data de-anonymization framework to assess the vulnerability of privacy protection techniques by accurately reconstructing a large portion of the original data. Specifically, the framework assumes that in principle, the target and auxiliary networks have common structural patterns, and they can be modeled together to infer the hidden structure of the target network. The essential components of our work include the following: (1) a robust network representation model for structural pattern learning; (2) the network representation based multi-view modeling of target network and auxiliary network; (3) the inference of the anonymized links via target network reconstruction. Experimental results on synthetic networks and three real-world networks demonstrate that auxiliary networks can be utilized by malicious adversaries for privacy inference attacks. Thus, the privacy protection of multi-view network data needs more sophisticated anonymization techniques.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords