Deep Random Forest and AraBert for Hate Speech Detection from Arabic Tweets

Kheir Eddine Daouadi; Yaakoub Boualleg; Oussama Guehairia

doi:10.3897/jucs.112604

Journal of Universal Computer Science (Nov 2023)

Deep Random Forest and AraBert for Hate Speech Detection from Arabic Tweets

Kheir Eddine Daouadi,
Yaakoub Boualleg,
Oussama Guehairia

Affiliations

Kheir Eddine Daouadi: Echahid Cheikh Larbi Tebessi University
Yaakoub Boualleg: Echahid Cheikh Larbi Tebessi University
Oussama Guehairia: Mohamed Khider University of Biskra

DOI: https://doi.org/10.3897/jucs.112604
Journal volume & issue: Vol. 29, no. 11
pp. 1319 – 1335

Abstract

Read online Read online Read online

Nowadays, hate speech detection from Arabic tweets attracts the attention of many researchers. Numerous systems and techniques have been proposed to address this classification challenge. Nonetheless, three major limits persist: the use of deep learning models with an excess of hyperparameters, the reliance on hand-crafted features, and the requirement for a huge amount of training data to achieve satisfactory performance. In this study, we propose Contextual Deep Random Forest (CDRF), a hate speech detection approach that combines contextual embedding and Deep Random Forest. From the experimental findings, the Arabic contextual embedding model proves to be highly effective in hate speech detection, outperforming the static embedding models. Additionally, we prove that the proposed CDRF significantly enhances the performance of Arabic hate speech classification.

Published in Journal of Universal Computer Science

ISSN: 0948-695X (Print); 0948-6968 (Online)
Publisher: Graz University of Technology
Country of publisher: Austria
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://lib.jucs.org/

About the journal

Abstract

Keywords