PLoS ONE (Jan 2013)

Early prediction of movie box office success based on Wikipedia activity big data.

  • Márton Mestyán,
  • Taha Yasseri,
  • János Kertész

DOI
https://doi.org/10.1371/journal.pone.0071226
Journal volume & issue
Vol. 8, no. 8
p. e71226

Abstract

Read online

Use of socially generated "big data" to access information about collective states of the minds in human societies has become a new paradigm in the emerging field of computational social science. A natural application of this would be the prediction of the society's reaction to a new product in the sense of popularity and adoption rate. However, bridging the gap between "real time monitoring" and "early predicting" remains a big challenge. Here we report on an endeavor to build a minimalistic predictive model for the financial success of movies based on collective activity data of online users. We show that the popularity of a movie can be predicted much before its release by measuring and analyzing the activity level of editors and viewers of the corresponding entry to the movie in Wikipedia, the well-known online encyclopedia.