Alexandria Engineering Journal (Jul 2023)

User identification for knowledge graph construction across multiple online social networks

  • Cuicui Ye,
  • Jing Yang,
  • Yan Mao

Journal volume & issue
Vol. 73
pp. 145 – 158

Abstract

Read online

User identification across multiple online social networks is beneficial for building knowledge graphs. Under privacy protection considerations, researchers have shown increasing interest in user identification based on username similarity. However, existing solutions rely on manual features extracted by domain experts and do not exploit the deep semantic features of usernames. Moreover, existing solutions are limited to monolingual user names such as English or Chinese, ignoring other multilingual usernames. This paper proposes a multilingual pre-trained model-based username similarity method for user identification across multiple online social networks. First, we use many multilingual corpora to enable the model to learn more semantic information and extract deep semantic features of usernames. Then, fine-tuning is performed on our constructed dataset of multilingual usernames across multiple online social networks. Ultimately assess the similarity of user identities across multiple online social networks. Our method facilitates user identification with limited data. Finally, the efficiency of our model is verified on three constructed real-world multilingual username datasets across multiple online social networks and compared with existing state-of-the-art methods. Experimental results show that the proposed algorithm outperforms the compared algorithms.

Keywords