IEEE Access (Jan 2024)

The Impact of the Weighted Features on the Accuracy of X-Platform’s User Credibility Detection Using Supervised Machine Learning

  • Nahid R. Abid-Althaqafi,
  • Hessah A. Alsalamah,
  • Walaa N. Ismail

DOI
https://doi.org/10.1109/ACCESS.2024.3353312
Journal volume & issue
Vol. 12
pp. 8471 – 8484

Abstract

Read online

Social media represent a vital actor in our lives, often serving as a primary source of information, surpassing traditional sources. Among these platforms, the X-Platform, which used to be called Twitter, has emerged as a leading space for the exchange of opinions and emotions. In this study, we introduced a supervised machine learning system designed to detect user credibility in this influential platform. User credibility detection depends largely on the features of the users on the platform. Feature weighting plays a pivotal role in identifying the significance of each feature in a dataset. It can indicate irrelevant features, which can lead to better performance in classification problems. This study aims to highlight the impact of weighted features on the accuracy of X-Platform User Credibility Detection (XUCD) using supervised machine learning methods, such as Principal Component Analysis (PCA) and correlation-coefficient algorithms, and tree-based methods, such as (ExtraTressClarifier) to extract new weighted features in the dataset and then use them to train our model to discover their impact on the accuracy of user credibility detection issues. As a result, we measured the effectiveness of different feature-weighting methods on different dataset categories to determine which obtained the best detection accuracy. Experiments were conducted on real user profiles, and statistical and emotional information was extracted from a publicly available dataset called (ArPFN). The improvement in XUCD accuracy using different weighting methods was dependent on the method and dataset category used.

Keywords