Spam detection for Youtube video comments using machine learning approaches

Andrew S. Xiao; Qilian Liang

Machine Learning with Applications (Jun 2024)

Spam detection for Youtube video comments using machine learning approaches

Andrew S. Xiao,
Qilian Liang

Affiliations

Andrew S. Xiao: Purdue University, Department of Computer Science, West Lafayette, 47907, IN, USA
Qilian Liang: University of Texas at Arlington, Department of Electrical Engineering, Arlington, 76019, TX, USA; Corresponding author.

Journal volume & issue: Vol. 16
p. 100550

Abstract

Read online

Machine Learning models have the ability to streamline the process by which Youtube video comments are filtered between legitimate comments (ham) and spam. In order to integrate machine learning models into regular usage on media-sharing platforms, recent approaches have aimed to develop models trained on Youtube comments, which have emerged as valuable tools for the classification and have enabled the identification of spam content and enhancing user experience. In this paper, eight machine learning approaches are applied to spam detection for YouTube comments. The eight machine learning models include Gaussian Naive Bayes, logistic regression, K-nearest neighbors (KNN) classifier, multi-layer perceptron (MLP), support vector machine (SVM) classifier, random forest classifier, decision tree classifier, and voting classifier. All eight models perform very well, specifically random forest approach can achieve almost perfect performance with average precision of 100% and AUC-ROC of 0.9841. The computational complexity of the eight machine learning approaches are compared.

Published in Machine Learning with Applications

ISSN: 2666-8270 (Online)
Publisher: Elsevier
Country of publisher: United Kingdom
LCC subjects: Science: Science (General): Cybernetics; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.journals.elsevier.com/machine-learning-with-applications

About the journal

Abstract

Keywords