Data in Brief (Apr 2024)

A comprehensive dataset for sentiment and emotion classification from Bangladesh e-commerce reviews

  • Mohammad Rifat Ahmmad Rashid,
  • Kazi Ferdous Hasan,
  • Rakibul Hasan,
  • Aritra Das,
  • Mithila Sultana,
  • Mahamudul Hasan

Journal volume & issue
Vol. 53
p. 110052

Abstract

Read online

In the rapidly evolving domain of e-commerce, analyzing customer feedback through reviews is crucial, particularly for understanding and enhancing consumer experience in the Bangladeshi market. Our comprehensive dataset, derived from two Bangladeshi e-commerce platforms, Daraz and Pickaboo, features a diverse collection of reviews in both Bengali and English, covering a broad range of products. These reviews are not only rich in linguistic variety but also encapsulate a spectrum of emotions, some even conveyed through emojis, offering a deep dive into consumer sentiment. Expert annotators have meticulously examined and categorized each review, classifying emotions into five distinct types - Happiness, Sadness, Fear, Anger, and Love - and sentiments into Positive (Happiness, Love) and Negative (Sadness, Anger, Fear) categories. This level of detailed annotation enables precise assessments of customer emotions and preferences, which are essential for evaluating and improving existing product offerings. Moreover, the insights gleaned from this dataset are invaluable for guiding future product development and uncovering new opportunities in the dynamic Bangladeshi market. Ultimately, this dataset not only serves as a significant resource for sentiment analysis using natural language processing (NLP) techniques but also contributes valuable insights into the unique consumer behavior patterns in Bangladesh, enriching the NLP community's understanding of diverse market dynamics.

Keywords