Re-Engineered Word Embeddings for Improved Document-Level Sentiment Analysis

Su Yang; Farzin Deravi

doi:10.3390/app12189287

Applied Sciences (Sep 2022)

Re-Engineered Word Embeddings for Improved Document-Level Sentiment Analysis

Su Yang,
Farzin Deravi

Affiliations

Su Yang: Department of Computer Science, Faculty of Science and Engineering, Swansea University, Swansea SA1 8EN, UK
Farzin Deravi: School of Engineering, University of Kent, Canterbury CT2 7NZ, UK

DOI: https://doi.org/10.3390/app12189287
Journal volume & issue: Vol. 12, no. 18
p. 9287

Abstract

Read online

In this paper, a novel re-engineering mechanism for the generation of word embeddings is proposed for document-level sentiment analysis. Current approaches to sentiment analysis often integrate feature engineering with classification, without optimizing the feature vectors explicitly. Engineering feature vectors to match the data between the training set and query sample as proposed in this paper could be a promising way for boosting the classification performance in machine learning applications. The proposed mechanism is designed to re-engineer the feature components from a set of embedding vectors for greatly increased between-class separation, hence better leveraging the informative content of the documents. The proposed mechanism was evaluated using four public benchmarking datasets for both two-way and five-way semantic classifications. The resulting embeddings have demonstrated substantially improved performance for a range of sentiment analysis tasks. Tests using all the four datasets achieved by far the best classification results compared with the state-of-the-art.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords