Automated Bot Detection Using Bayesian Latent Class Models in Online Surveys

Zachary Joseph Roman; Holger Brandt; Jason Michael Miller

doi:10.3389/fpsyg.2022.789223

Frontiers in Psychology (Apr 2022)

Automated Bot Detection Using Bayesian Latent Class Models in Online Surveys

Zachary Joseph Roman,
Holger Brandt,
Jason Michael Miller

Affiliations

Zachary Joseph Roman: Department of Psychology, University of Zurich, Zürich, Switzerland
Holger Brandt: Department of Psychology, Faculty of Mathematics and Natural Sciences, University of Tübingen, Tübingen, Germany
Jason Michael Miller: Department of Psychology, University of Kansas, Lawrence, KS, United States

DOI: https://doi.org/10.3389/fpsyg.2022.789223
Journal volume & issue: Vol. 13

Abstract

Read online

Behavioral scientists have become increasingly reliant on online survey platforms such as Amazon's Mechanical Turk (Mturk). These platforms have many advantages, for example it provides ease of access to difficult to sample populations, a large pool of participants, and an easy to use implementation. A major drawback is the existence of bots that are used to complete online surveys for financial gain. These bots contaminate data and need to be identified in order to draw valid conclusions from data obtained with these platforms. In this article, we will provide a Bayesian latent class joint modeling approach that can be routinely applied to identify bots and simultaneously estimate a model of interest. This method can be used to separate the bots' response patterns from real human responses that were provided in line with the item content. The model has the advantage that it is very flexible and is based on plausible assumptions that are met in most empirical settings. We will provide a simulation study that investigates the performance of the model under several relevant scenarios including sample size, proportion of bots, and model complexity. We will show that ignoring bots will lead to severe parameter bias whereas the Bayesian latent class model results in unbiased estimates and thus controls this source of bias. We will illustrate the model and its capabilities with data from an empirical political ideation survey with known bots. We will discuss the implications of the findings with regard to future data collection via online platforms.

Published in Frontiers in Psychology

ISSN: 1664-1078 (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Philosophy. Psychology. Religion: Psychology
Website: https://www.frontiersin.org/journals/psychology

About the journal

Abstract

Keywords