Emotion Processing by Applying a Fuzzy-Based Vader Lexicon and a Parallel Deep Belief Network Over Massive Data

Fatima Es-Sabery; Ibrahim Es-Sabery; Abdellatif Hair; Beatriz Sainz-De-Abajo; Begonya Garcia-Zapirain

doi:10.1109/ACCESS.2022.3200389

IEEE Access (Jan 2022)

Emotion Processing by Applying a Fuzzy-Based Vader Lexicon and a Parallel Deep Belief Network Over Massive Data

Fatima Es-Sabery,
Ibrahim Es-Sabery,
Abdellatif Hair,
Beatriz Sainz-De-Abajo,
Begonya Garcia-Zapirain

Affiliations

Fatima Es-Sabery: ORCiD; Department of Computer Science, Faculty of Sciences and Technology, Sultan Moulay Slimane University, Beni Mellal, Morocco
Ibrahim Es-Sabery: Department of Computer Science, Faculty of Sciences and Technology, Sultan Moulay Slimane University, Beni Mellal, Morocco
Abdellatif Hair: ORCiD; Department of Computer Science, Faculty of Sciences and Technology, Sultan Moulay Slimane University, Beni Mellal, Morocco
Beatriz Sainz-De-Abajo: ORCiD; Department of Signal Theory, Communications and Telematics Engineering, University of Valladolid, Valladolid, Spain
Begonya Garcia-Zapirain: ORCiD; EVIDA Research Group, University of Deusto, Bilbao, Spain

DOI: https://doi.org/10.1109/ACCESS.2022.3200389
Journal volume & issue: Vol. 10
pp. 87870 – 87899

Abstract

Read online

Emotion processing has been a very intense domain of investigation in data analysis and NLP during the previous few years. Currently, the algorithms of the deep neural networks have been applied for opinion mining tasks with good results. Among various neuronal models applied for opinion mining a deep belief network (DBN) model has gained more attention. In this proposal, we have developed a combined classifier based on fuzzy Vader lexicon and a parallel deep belief network for emotion analysis. We have implemented multiple pretreatment techniques to improve the quality and soundness of the data and eliminate disturbing data. Afterward, we have performed a semi-automatic dataset labeling using a combination of two different methods: Mamdani’s fuzzy system and Vader lexicon. As well, we have applied four feature extractors, which are: GloVe, TFIDF (Trigram), TFIDF (Bigram), TFIDF (Unigram) with the aim of transforming each incoming tweet into a digital value vector. In addition, we have integrated three feature selectors, namely: The ANOVA method, the chi-square approach and the mutual information technique with the objective of selecting the most relevant features. Further, we have implemented the DBN as classifier for classifying each inputted tweet into three categories: neutral, positive or negative. At the end, we have deployed our proposed approach in parallel way employing both Hadoop and Spark framework with the purpose of overcoming the problem of long runtime of massive data. Furthermore, we have carried out a comparison between our newly suggested hybrid approach and alternative hybrid models available in the literature. From the experimental findings, it was found that our suggested vague parallel approach is more powerful than the baseline patterns in terms of false negative rate (1.33%), recall (99.75%), runtime (32.95s), convergence, stability, F1 score (99.53%), accuracy (98.96%), error rate (1.04%), kappa-Static (99.1%), complexity, false positive rate (0.25%), precision rate (97.59%) and specificity rate (98.67%). As a conclusion, our vague parallel approach outperforms baseline and deep learning models, as well as certain other approaches chosen from the literature.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords