Sensors (Feb 2022)

A Novel Framework for Generating Personalized Network Datasets for NIDS Based on Traffic Aggregation

  • Pablo Velarde-Alvarado,
  • Hugo Gonzalez,
  • Rafael Martínez-Peláez,
  • Luis J. Mena,
  • Alberto Ochoa-Brust,
  • Efraín Moreno-García,
  • Vanessa G. Félix,
  • Rodolfo Ostos

DOI
https://doi.org/10.3390/s22051847
Journal volume & issue
Vol. 22, no. 5
p. 1847

Abstract

Read online

In this paper, we addressed the problem of dataset scarcity for the task of network intrusion detection. Our main contribution was to develop a framework that provides a complete process for generating network traffic datasets based on the aggregation of real network traces. In addition, we proposed a set of tools for attribute extraction and labeling of traffic sessions. A new dataset with botnet network traffic was generated by the framework to assess our proposed method with machine learning algorithms suitable for unbalanced data. The performance of the classifiers was evaluated in terms of macro-averages of F1-score (0.97) and the Matthews Correlation Coefficient (0.94), showing a good overall performance average.

Keywords