Design and Implementation of Fast Spoken Foul Language Recognition with Different End-to-End Deep Neural Network Architectures

Abdulaziz Saleh Ba Wazir; Hezerul Abdul Karim; Mohd Haris Lye Abdullah; Nouar AlDahoul; Sarina Mansor; Mohammad Faizal Ahmad Fauzi; John See; Ahmad Syazwan Naim

doi:10.3390/s21030710

Sensors (Jan 2021)

Design and Implementation of Fast Spoken Foul Language Recognition with Different End-to-End Deep Neural Network Architectures

Abdulaziz Saleh Ba Wazir,
Hezerul Abdul Karim,
Mohd Haris Lye Abdullah,
Nouar AlDahoul,
Sarina Mansor,
Mohammad Faizal Ahmad Fauzi,
John See,
Ahmad Syazwan Naim

Affiliations

Abdulaziz Saleh Ba Wazir: Faculty of Engineering, Multimedia University, Cyberjaya 63100, Malaysia
Hezerul Abdul Karim: Faculty of Engineering, Multimedia University, Cyberjaya 63100, Malaysia
Mohd Haris Lye Abdullah: Faculty of Engineering, Multimedia University, Cyberjaya 63100, Malaysia
Nouar AlDahoul: Faculty of Engineering, Multimedia University, Cyberjaya 63100, Malaysia
Sarina Mansor: Faculty of Engineering, Multimedia University, Cyberjaya 63100, Malaysia
Mohammad Faizal Ahmad Fauzi: Faculty of Engineering, Multimedia University, Cyberjaya 63100, Malaysia
John See: Faculty of Computing and Informatics, Multimedia University, Cyberjaya 63100, Malaysia
Ahmad Syazwan Naim: IPTV Development, Unifi Content, Telekom Malaysia Berhad, Cyberjaya 63100, Malaysia

DOI: https://doi.org/10.3390/s21030710
Journal volume & issue: Vol. 21, no. 3
p. 710

Abstract

Read online

Given the excessive foul language identified in audio and video files and the detrimental consequences to an individual’s character and behaviour, content censorship is crucial to filter profanities from young viewers with higher exposure to uncensored content. Although manual detection and censorship were implemented, the methods proved tedious. Inevitably, misidentifications involving foul language owing to human weariness and the low performance in human visual systems concerning long screening time occurred. As such, this paper proposed an intelligent system for foul language censorship through a mechanized and strong detection method using advanced deep Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs) through Long Short-Term Memory (LSTM) cells. Data on foul language were collected, annotated, augmented, and analysed for the development and evaluation of both CNN and RNN configurations. Hence, the results indicated the feasibility of the suggested systems by reporting a high volume of curse word identifications with only 2.53% to 5.92% of False Negative Rate (FNR). The proposed system outperformed state-of-the-art pre-trained neural networks on the novel foul language dataset and proved to reduce the computational cost with minimal trainable parameters.

Published in Sensors

ISSN: 1424-8220 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Chemical technology
Website: http://www.mdpi.com/journal/sensors

About the journal

Abstract

Keywords