Adaptive Similarity Function with Structural Features of Network Embedding for Missing Link Prediction

Chuanting Zhang; Ke-Ke Shang; Jingping Qiao

doi:10.1155/2021/1277579

Complexity (Jan 2021)

Adaptive Similarity Function with Structural Features of Network Embedding for Missing Link Prediction

Chuanting Zhang,
Ke-Ke Shang,
Jingping Qiao

Affiliations

Chuanting Zhang: Computer, Electrical and Mathematical Science Engineering Division
Ke-Ke Shang: Computational Communication Collaboratory
Jingping Qiao: School of Information Science and Engineering

DOI: https://doi.org/10.1155/2021/1277579
Journal volume & issue: Vol. 2021

Abstract

Read online

Link prediction is a fundamental problem of data science, which usually calls for unfolding the mechanisms that govern the micro-dynamics of networks. In this regard, using features obtained from network embedding for predicting links has drawn widespread attention. Although methods based on edge features or node similarity have been proposed to solve the link prediction problem, many technical challenges still exist due to the unique structural properties of networks, especially when the networks are sparse. From the graph mining perspective, we first give empirical evidence of the inconsistency between heuristic and learned edge features. Then, we propose a novel link prediction framework, AdaSim, by introducing an Adaptive Similarity function using features obtained from network embedding based on random walks. The node feature representations are obtained by optimizing a graph-based objective function. Instead of generating edge features using binary operators, we perform link prediction solely leveraging the node features of the network. We define a flexible similarity function with one tunable parameter, which serves as a penalty of the original similarity measure. The optimal value is learned through supervised learning and thus is adaptive to data distribution. To evaluate the performance of our proposed algorithm, we conduct extensive experiments on eleven disparate networks of the real world. Experimental results show that AdaSim achieves better performance than state-of-the-art algorithms and is robust to different sparsities of the networks.

Published in Complexity

ISSN: 1076-2787 (Print); 1099-0526 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://onlinelibrary.wiley.com/journal/8503

About the journal