Dataset of controversial news posts in Spanish from the reader's perspective

Cesar Macias; Hiram Calvo; Omar Juárez Gambino

Data in Brief (Apr 2024)

Dataset of controversial news posts in Spanish from the reader's perspective

Cesar Macias,
Hiram Calvo,
Omar Juárez Gambino

Affiliations

Cesar Macias: Centro de Investigación en Computación, Instituto Politécnico Nacional. Av. Juan de Dios Bátiz, esq. Miguel Othón de Mendizabal, Col. Nueva Industrial Vallejo, Alcaldía Gustavo A. Madero, C.P. 07700, CDMX, México
Hiram Calvo: Centro de Investigación en Computación, Instituto Politécnico Nacional. Av. Juan de Dios Bátiz, esq. Miguel Othón de Mendizabal, Col. Nueva Industrial Vallejo, Alcaldía Gustavo A. Madero, C.P. 07700, CDMX, México
Omar Juárez Gambino: Escuela Superior de Cómputo, Instituto Politécnico Nacional. Av. Luis Enrique Erro S/N, Unidad Profesional Adolfo López Mateos, Zacatenco. Alcaldía Gustavo A. Madero C.P. 07738, CDMX, México

Journal volume & issue: Vol. 53
p. 110220

Abstract

Read online

This paper presents a corpus of Spanish news posts obtained from X with the annotation of controversy made via crowdsourcing. A total of 60 tweets were obtained from 8 different newspapers. For the annotation task, a survey was developed and sent to 31 different participants to answer it with the controversy level they perceived from the news post summary and headline presented on the post. The most frequent selected option was assigned as the initial controversy level of the post. The final annotation of the corpus was made via an analysis of the raw data by computing the Inter Annotator Agreement (IAA). The analysis showed that the binarization of the data was the most convenient way to annotate it. A potential use for this dataset is detailed in further sections.

Published in Data in Brief

ISSN: 2352-3409 (Online)
Publisher: Elsevier
Country of publisher: United States
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Science: Science (General)
Website: http://www.journals.elsevier.com/data-in-brief/

About the journal

Abstract

Keywords