Applied Sciences (Nov 2021)

A Language Model for Misogyny Detection in Latin American Spanish Driven by Multisource Feature Extraction and Transformers

  • Edwin Aldana-Bobadilla,
  • Alejandro Molina-Villegas,
  • Yuridia Montelongo-Padilla,
  • Ivan Lopez-Arevalo,
  • Oscar S. Sordia

DOI
https://doi.org/10.3390/app112110467
Journal volume & issue
Vol. 11, no. 21
p. 10467

Abstract

Read online

Creating effective mechanisms to detect misogyny online automatically represents significant scientific and technological challenges. The complexity of recognizing misogyny through computer models lies in the fact that it is a subtle type of violence, it is not always explicitly aggressive, and it can even hide behind seemingly flattering words, jokes, parodies, and other expressions. Currently, it is even difficult to have an exact figure for the rate of misogynistic comments online because, unlike other types of violence, such as physical violence, these events are not registered by any statistical systems. This research contributes to the development of models for the automatic detection of misogynistic texts in Latin American Spanish and contributes to the design of data augmentation methodologies since the amount of data required for deep learning models is considerable.

Keywords