Social Botomics: A Systematic Ensemble ML Approach for Explainable and Multi-Class Bot Detection

Ilias Dimitriadis; Konstantinos Georgiou; Athena Vakali

doi:10.3390/app11219857

Applied Sciences (Oct 2021)

Social Botomics: A Systematic Ensemble ML Approach for Explainable and Multi-Class Bot Detection

Ilias Dimitriadis,
Konstantinos Georgiou,
Athena Vakali

Affiliations

Ilias Dimitriadis: Department of Informatics, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece
Konstantinos Georgiou: Department of Informatics, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece
Athena Vakali: Department of Informatics, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece

DOI: https://doi.org/10.3390/app11219857
Journal volume & issue: Vol. 11, no. 21
p. 9857

Abstract

Read online

OSN platforms are under attack by intruders born and raised within their own ecosystems. These attacks have multiple scopes from mild critiques to violent offences targeting individual or community rights and opinions. Negative publicity on microblogging platforms, such as Twitter, is due to the infamous Twitter bots which highly impact posts’ circulation and virality. A wide and ongoing research effort has been devoted to develop appropriate countermeasures against emerging “armies of bots”. However, the battle against bots is still intense and unfortunately, it seems to lean on the bot-side. Since, in an effort to win any war, it is critical to know your enemy, this work aims to demystify, reveal, and widen inherent characteristics of Twitter bots such that multiple types of bots are recognized and spotted early. More specifically in this work we: (i) extensively analyze the importance and the type of data and features used to generate ML models for bot classification, (ii) address the open problem of multi-class bot detection, identifying new types of bots, and share two new datasets towards this objective, (iii) provide new individual ML models for binary and multi-class bot classification and (iv) utilize explainable methods and provide comprehensive visualizations to clearly demonstrate interpretable results. Finally, we utilize all of the above in an effort to improve the so called Bot-Detective online service. Our experiments demonstrate high accuracy, explainability and scalability, comparable with the state of the art, despite multi-class classification challenges.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords