A Novel Stacked Ensemble for Hate Speech Recognition

Mona Khalifa A. Aljero; Nazife Dimililer

doi:10.3390/app112411684

Applied Sciences (Dec 2021)

A Novel Stacked Ensemble for Hate Speech Recognition

Mona Khalifa A. Aljero,
Nazife Dimililer

Affiliations

Mona Khalifa A. Aljero: Department of Applied Mathematics & Computer Sciences, Faculty of Arts and Sciences, Eastern Mediterranean University, Via Mersin 10, Famagusta 99628, North Cyprus, Turkey
Nazife Dimililer: Department of Information Technology, School of Computing and Technology, Eastern Mediterranean University, Via Mersin 10, Famagusta 99628, North Cyprus, Turkey

DOI: https://doi.org/10.3390/app112411684
Journal volume & issue: Vol. 11, no. 24
p. 11684

Abstract

Read online

Detecting harmful content or hate speech on social media is a significant challenge due to the high throughput and large volume of content production on these platforms. Identifying hate speech in a timely manner is crucial in preventing its dissemination. We propose a novel stacked ensemble approach for detecting hate speech in English tweets. The proposed architecture employs an ensemble of three classifiers, namely support vector machine (SVM), logistic regression (LR), and XGBoost classifier (XGB), trained using word2vec and universal encoding features. The meta classifier, LR, combines the outputs of the three base classifiers and the features employed by the base classifiers to produce the final output. It is shown that the proposed architecture improves the performance of the widely used single classifiers as well as the standard stacking and classifier ensemble using majority voting. We also present results on the use of various combinations of machine learning classifiers as base classifiers. The experimental results from the proposed architecture indicated an improvement in the performance on all four datasets compared with the standard stacking, base classifiers, and majority voting. Furthermore, on three of these datasets, the proposed architecture outperformed all state-of-the-art systems.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords