Research Ethics Review (Jul 2024)

Passive data collection on Reddit: a practical approach

  • Tiago Rocha-Silva,
  • Conceição Nogueira,
  • Liliana Rodrigues

DOI
https://doi.org/10.1177/17470161231210542
Journal volume & issue
Vol. 20

Abstract

Read online

Since its onset, scholars have characterized social media as a valuable source for data collection since it presents several benefits (e.g. exploring research questions with hard-to-reach populations). Nonetheless, methods of online data collection are riddled with ethical and methodological challenges that researchers must consider if they want to adopt good practices when collecting and analyzing online data. Drawing from our primary research project, where we collected passive online data on Reddit, we explore and detail the steps that researchers must consider before collecting online data: (1) planning online data collection; (2) ethical considerations; and (3) data collection. We also discuss two atypical questions that researchers should also consider: (1) how to handle deleted user-generated content; and (2) how to quote user-generated content. Moving on from the dichotomous discussion between what is public and private data, we present recommendations for good practices when collecting and analyzing qualitative online data.