Security Hardening of Botnet Detectors Using Generative Adversarial Networks

Rizwan Hamid Randhawa; Nauman Aslam; Mohammad Alauthman; Husnain Rafiq; Frank Comeau

doi:10.1109/ACCESS.2021.3083421

IEEE Access (Jan 2021)

Security Hardening of Botnet Detectors Using Generative Adversarial Networks

Rizwan Hamid Randhawa,
Nauman Aslam,
Mohammad Alauthman,
Husnain Rafiq,
Frank Comeau

Affiliations

Rizwan Hamid Randhawa: ORCiD; Department of Computer and Information Systems, Northumbria University, Newcastle upon Tyne, U.K
Nauman Aslam: ORCiD; Department of Computer and Information Systems, Northumbria University, Newcastle upon Tyne, U.K
Mohammad Alauthman: ORCiD; Department of Information Security, University of Petra, Amman, Jordan
Husnain Rafiq: Department of Computer and Information Systems, Northumbria University, Newcastle upon Tyne, U.K
Frank Comeau: Department of Engineering, St. Francis Xavier University, Antigonish, NS, Canada

DOI: https://doi.org/10.1109/ACCESS.2021.3083421
Journal volume & issue: Vol. 9
pp. 78276 – 78292

Abstract

Read online

Machine learning (ML) based botnet detectors are no exception to traditional ML models when it comes to adversarial evasion attacks. The datasets used to train these models have also scarcity and imbalance issues. We propose a new technique named Botshot, based on generative adversarial networks (GANs) for addressing these issues and proactively making botnet detectors aware of adversarial evasions. Botshot is cost-effective as compared to the network emulation for botnet traffic data generation rendering the dedicated hardware resources unnecessary. First, we use the extended set of network flow and time-based features for three publicly available botnet datasets. Second, we utilize two GANs (vanilla, conditional) for generating realistic botnet traffic. We evaluate the generator performance using classifier two-sample test (C2ST) with 10-fold 70-30 train-test split and propose the use of ’recall’ in contrast to ’accuracy’ for proactively learning adversarial evasions. We then augment the train set with the generated data and test using the unchanged test set. Last, we compare our results with benchmark oversampling methods with augmentation of additional botnet traffic data in terms of average accuracy, precision, recall and F1 score over six different ML classifiers. The empirical results demonstrate the effectiveness of the GAN-based oversampling for learning in advance the adversarial evasion attacks on botnet detectors.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords