A Frequent Named Entities-Based Approach for Interpreting Reputation in Twitter

Nacéra Bennacer Seghouani; Francesca Bugiotti; Moditha Hewasinghage; Suela Isaj; Gianluca Quercini

doi:10.1007/s41019-018-0066-4

Data Science and Engineering (Jun 2018)

A Frequent Named Entities-Based Approach for Interpreting Reputation in Twitter

Nacéra Bennacer Seghouani,
Francesca Bugiotti,
Moditha Hewasinghage,
Suela Isaj,
Gianluca Quercini

Affiliations

Nacéra Bennacer Seghouani: LRI, CentraleSupélec, Paris-Saclay University
Francesca Bugiotti: LRI, CentraleSupélec, Paris-Saclay University
Moditha Hewasinghage: Universitat Politecnica de Catalunya
Suela Isaj: Aalborg University
Gianluca Quercini: LRI, CentraleSupélec, Paris-Saclay University

DOI: https://doi.org/10.1007/s41019-018-0066-4
Journal volume & issue: Vol. 3, no. 2
pp. 86 – 100

Abstract

Read online

Abstract Twitter is a social network that provides a powerful source of data. The analysis of those data offers many challenges among those stands out the opportunity to find reputation of a product, a person or any other entity of interest. Several approaches for sentiment analysis have been proposed in the literature to assess the general opinion expressed in tweets on an entity. Nevertheless, these methods aggregate sentiment scores retrieved from tweets, which is a static view to evaluate the overall reputation of an entity. The reputation of an entity is not static; entities collaborate with each other, and they get involved in different events over time. A simple aggregation of sentiment scores is then not sufficient to represent this dynamism. In this paper, we present a new approach to determine the reputation of an entity on the basis of the set of events in which it is involved. To achieve this, we propose a new sampling method driven by a tweet weighting measure to give a better quality and summary of the target entity. We introduce the concept of Frequent Named Entities to determine the events involving the target entity. Our evaluation achieved for different entities shows that 90% of the reputation of an entity originates from the events it is involved in and the breakdown into events allows interpreting the reputation in a transparent and self-explanatory way.

Published in Data Science and Engineering

ISSN: 2364-1185 (Print); 2364-1541 (Online)
Publisher: SpringerOpen
Country of publisher: Germany
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Information technology; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://www.springer.com/41019

About the journal

Abstract

Keywords