European Journal of Business Science and Technology (Dec 2015)

Automated Extraction of Typical Expressions Describing Product Features from Customer Reviews

  • Karel Barák,
  • František Dařena,
  • Jan Žižka

DOI
https://doi.org/10.11118/ejobsat.v1i2.27
Journal volume & issue
Vol. 1, no. 2
pp. 83 – 92

Abstract

Read online

The paper presents a procedure that helps in revealing topics hidden in large collections of textual documents (such as customer reviews) related to a certain group of products or services. Together with identification of the groups containing the topics the lists of important expressions is presented which helps in understanding what characterizes these aspects most typically from the semantic point of view. The procedure includes determining an appropriate number of groups representing the prevailing topics, partitioning the documents into a desired number of groups using clustering, extracting significant typical features of documents from each group with application of feature selection methods, and evaluating the outcomes with the assistance of a human expert. The results show that the presented approach, consisting mostly of automated steps, is able to separate and characterize the aspects of a certain product as discussed by the customers and be later useful, e.g., for handling customer complaints, designing promotional campaigns, or improving the products.

Keywords