Sarcasm Detection in Indonesian-English Code-Mixed Text Using Multihead Attention-Based Convolutional and Bi-Directional GRU

Mochamad Alfan Rosid; Daniel Oranova Siahaan; Ahmad Saikhu

doi:10.1109/ACCESS.2024.3436107

IEEE Access (Jan 2024)

Sarcasm Detection in Indonesian-English Code-Mixed Text Using Multihead Attention-Based Convolutional and Bi-Directional GRU

Mochamad Alfan Rosid,
Daniel Oranova Siahaan,
Ahmad Saikhu

Affiliations

Mochamad Alfan Rosid: ORCiD; Department of Informatics, Faculty of Intelligent Electrical and Informatics Technology, Institut Teknologi Sepuluh Nopember (ITS), Sukolilo, Surabaya, Indonesia
Daniel Oranova Siahaan: ORCiD; Department of Informatics, Faculty of Intelligent Electrical and Informatics Technology, Institut Teknologi Sepuluh Nopember (ITS), Sukolilo, Surabaya, Indonesia
Ahmad Saikhu: Department of Informatics, Faculty of Intelligent Electrical and Informatics Technology, Institut Teknologi Sepuluh Nopember (ITS), Sukolilo, Surabaya, Indonesia

DOI: https://doi.org/10.1109/ACCESS.2024.3436107
Journal volume & issue: Vol. 12
pp. 137063 – 137079

Abstract

Read online

Detecting sarcasm in text is a very challenging task. Sarcasm often depends on context, tone, and cultural references, which can be difficult for machines to understand. In addition, the increasing occurrence of code-mixing in social media posts poses new challenges in sarcasm detection. Research on sarcasm detection in mixed-code text written in languages other than English is still limited owing to the unavailability of public datasets. To overcome this issue, a dataset was built for sarcasm detection in Indonesian-English mixed-code texts. Furthermore, a hybrid model based on a convolutional neural network (CNN) with multi-head attention and a bi-directional gated recurrent unit (BiGRU), named MHA-CovBi, is proposed for sarcasm detection. In the proposed MHA-CovBi model, a combination of FastText and GloVe word embeddings is utilized to assist the model in understanding and processing texts in different languages. GloVe pretrained word embedding is used for vector representation of English words, while FastText pretrained word embedding is used for vector representation of Indonesian words. Moreover, an auxiliary pragmatic feature illustrating the number of pragmatic markers in tweets was incorporated to enhance detection performance. In addition, this study presents a language detection scheme and transliteration process that can be used to handle languages other than Indonesian and English using Google Translate API. The performance of the proposed model was evaluated through comparative analysis against existing approaches. The proposed model successfully outperformed current state-of-the-art models, achieving an accuracy of 94.60% and F1 score of 94.38%.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords