Hadoop framework for efficient sentiment classification using trees

K. Sridharan; G. Komarasamy; S. Daniel Madan Raja

doi:10.1049/iet-net.2019.0208

IET Networks (Sep 2020)

Hadoop framework for efficient sentiment classification using trees

K. Sridharan,
G. Komarasamy,
S. Daniel Madan Raja

Affiliations

K. Sridharan: Anna UniversityChennaiIndia
G. Komarasamy: Department of Computer Science and EngineeringJain UniversityBangaloreIndia
S. Daniel Madan Raja: Department of Information TechnologyBannari Amman Institute of TechnologySathyamangalamIndia

DOI: https://doi.org/10.1049/iet-net.2019.0208
Journal volume & issue: Vol. 9, no. 5
pp. 223 – 228

Abstract

Read online

Due to the increase in the speed of generation of data, the authors are forced to handle a massive volume of data with the help of conventional machine learning algorithms. Big data is an enormous volume of data which is beyond the capacity of the traditional database software tool to collect, store, manage, and process within a stipulated time limit. Sentiment analysis is analysing the data by classifying the text on the basis of strength and polarity of opinion (positive/negative) words that define the text. While handling big data, Hadoop provides a platform for users to develop their own sentiment analysis with the help of a lexicon dictionary or available application programming interface (API) or external programs. The aim of classifying data is to analyse extensive data and develop an appropriate description or model for every organised class with the feature present in the data. In this work, the feature extraction based on term frequency‐inverse document frequency is utilised and the Hadoop framework in attaining a useful classification with the help of random forest techniques.

Published in IET Networks

ISSN: 2047-4954 (Print); 2047-4962 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Telecommunication
Website: https://ietresearch.onlinelibrary.wiley.com/journal/20474962

About the journal

Abstract

Keywords