Data in Brief (Oct 2019)

Data set for automatic detection of online misogynistic speech

  • Theo Lynn,
  • Patricia Takako Endo,
  • Pierangelo Rosati,
  • Ivanovitch Silva,
  • Guto Leoni Santos,
  • Debbie Ging

Journal volume & issue
Vol. 26

Abstract

Read online

The data set is composed of 2285 definitions posted on the Urban Dictionary platform from 1999 to May 2016. The data was classified as misogynistic and non-misogynistic by three independent researchers with domain knowledge. The data set is available in public repository in a table containing two columns: the text-based definition from Urban Dictionary and its respective classification (1 for misogynistic and 0 for non-misogynistic). Keywords: Misogyny detection, Misogynistic speech, Hate speech, Online speech, Urban dictionary