UHD Journal of Science and Technology (Sep 2023)

Link Prediction in Dynamic Networks Based on the Selection of Similarity Criteria and Machine Learning

  • Karwan Mohammed HamaKarim

DOI
https://doi.org/10.21928/uhdjst.v7n2y2023.pp32-39
Journal volume & issue
Vol. 7, no. 2
pp. 32 – 39

Abstract

Read online

The study’s findings showed that link prediction utilizing the similarity learning model in dynamic networks (LSDN) performed better than other learning techniques including neural network learning and decision tree learning in terms of the three criteria of accuracy, coverage, and efficiency., Compared to the random forest approach, the LSDN learning algorithm’s link prediction accuracy increased from 97% to 99%. The proposed method’s use of oversampling, which improved link prediction accuracy, was the cause of the improvement in area under the curve (AUC). To bring the ratio of the classes closer together, the suggested strategy attempted to produce more samples from the minority class. In addition, similarity criteria were chosen utilizing feature selection techniques based on correlation that had a strong link with classes. This technique decreased over-fitting and improved the suggested method’s test data generalizability. Based on the three criteria (accuracy, coverage, and efficiency), the research’s findings demonstrated that link prediction utilizing the similarity LSDN outperformed other learning techniques including neural network learning and decision tree learning. Compared to the random forest algorithm, the LSDN algorithm’s link prediction accuracy increased from 97% to 99%. The oversampling in the suggested strategy, which increased link prediction accuracy, is what caused the increase in AUC. To bring the ratio of the classes closer together, the suggested strategy attempted to produce more samples from the minority class. In addition, similarity criteria were chosen utilizing feature selection techniques based on correlation that had a strong link with classes. This technique decreased over-fitting and improved the suggested method’s test data generalizability.

Keywords