ASNM Datasets: A Collection of Network Attacks for Testing of Adversarial Classifiers and Intrusion Detectors

Ivan Homoliak; Kamil Malinka; Petr Hanacek

doi:10.1109/ACCESS.2020.3001768

IEEE Access (Jan 2020)

ASNM Datasets: A Collection of Network Attacks for Testing of Adversarial Classifiers and Intrusion Detectors

Ivan Homoliak,
Kamil Malinka,
Petr Hanacek

Affiliations

Ivan Homoliak: ORCiD; Centre of Excellence IT4Innovations, Faculty of Information Technology, Brno University of Technology, Brno, Czech Republic
Kamil Malinka: ORCiD; Centre of Excellence IT4Innovations, Faculty of Information Technology, Brno University of Technology, Brno, Czech Republic
Petr Hanacek: ORCiD; Centre of Excellence IT4Innovations, Faculty of Information Technology, Brno University of Technology, Brno, Czech Republic

DOI: https://doi.org/10.1109/ACCESS.2020.3001768
Journal volume & issue: Vol. 8
pp. 112427 – 112453

Abstract

Read online

In this paper, we present three datasets that have been built from network traffic traces using ASNM (Advanced Security Network Metrics) features, designed in our previous work. The first dataset was built using a state-of-the-art dataset CDX 2009 that was collected during a cyber defense exercise, while the remaining two datasets were collected by us in 2015 and 2018 using publicly available network services containing buffer overflow and other high severity vulnerabilities. These two datasets contain several adversarial obfuscation techniques that were applied onto malicious as well as legitimate traffic samples during “the execution” of their TCP network connections. Adversarial obfuscation techniques were used for evading machine learning-based network intrusion detection classifiers. We show that the performance of such classifiers can be improved when partially augmenting their training data by samples obtained from obfuscation techniques. In detail, we utilized tunneling obfuscation in HTTP(S) protocol and non-payload-based obfuscations modifying various properties of network traffic by, e.g., TCP segmentation, re-transmissions, corrupting and reordering of packets, etc. To the best of our knowledge, this is the first collection of network traffic data that contains adversarial techniques and is intended for non-payload-based network intrusion detection and adversarial classification. Provided datasets enable testing of the evasion resistance of arbitrary machine learning-based classifiers.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords