PLoS ONE (Jan 2021)

A dataset for the study of identity at scale: Annual Prevalence of American Twitter Users with specified Token in their Profile Bio 2015-2020.

  • Jason Jeffrey Jones

DOI
https://doi.org/10.1371/journal.pone.0260185
Journal volume & issue
Vol. 16, no. 11
p. e0260185

Abstract

Read online

Personally expressed identity is who or what an individual themselves says they are, and it should be studied at scale. At scale means with data on millions of individuals, which is newly available and comes timestamped and geocoded. This work introduces a dataset for the study of identity at scale and describes the method for collecting and aggregating such data. Further, tools and theory for working with the data are presented. A demonstration analysis provides evidence that personal, individual development and changing cultural norms can be observed with these data and methods.