Social Media Hate Speech Detection Using Explainable Artificial Intelligence (XAI)

Harshkumar Mehta; Kalpdrum Passi

doi:10.3390/a15080291

Algorithms (Aug 2022)

Social Media Hate Speech Detection Using Explainable Artificial Intelligence (XAI)

Harshkumar Mehta,
Kalpdrum Passi

Affiliations

Harshkumar Mehta: School of Engineering and Computer Science, Laurentian University, Sudbury, ON P3E 2C6, Canada
Kalpdrum Passi: School of Engineering and Computer Science, Laurentian University, Sudbury, ON P3E 2C6, Canada

DOI: https://doi.org/10.3390/a15080291
Journal volume & issue: Vol. 15, no. 8
p. 291

Abstract

Read online

Explainable artificial intelligence (XAI) characteristics have flexible and multifaceted potential in hate speech detection by deep learning models. Interpreting and explaining decisions made by complex artificial intelligence (AI) models to understand the decision-making process of these model were the aims of this research. As a part of this research study, two datasets were taken to demonstrate hate speech detection using XAI. Data preprocessing was performed to clean data of any inconsistencies, clean the text of the tweets, tokenize and lemmatize the text, etc. Categorical variables were also simplified in order to generate a clean dataset for training purposes. Exploratory data analysis was performed on the datasets to uncover various patterns and insights. Various pre-existing models were applied to the Google Jigsaw dataset such as decision trees, k-nearest neighbors, multinomial naïve Bayes, random forest, logistic regression, and long short-term memory (LSTM), among which LSTM achieved an accuracy of 97.6%. Explainable methods such as LIME (local interpretable model—agnostic explanations) were applied to the HateXplain dataset. Variants of BERT (bidirectional encoder representations from transformers) model such as BERT + ANN (artificial neural network) with an accuracy of 93.55% and BERT + MLP (multilayer perceptron) with an accuracy of 93.67% were created to achieve a good performance in terms of explainability using the ERASER (evaluating rationales and simple English reasoning) benchmark.

Published in Algorithms

ISSN: 1999-4893 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.mdpi.com/journal/algorithms

About the journal

Abstract

Keywords