A Multi-Agent Intrusion Detection System Optimized by a Deep Reinforcement Learning Approach with a Dataset Enlarged Using a Generative Model to Reduce the Bias Effect

Matthieu Mouyart; Guilherme Medeiros Machado; Jae-Yun Jun

doi:10.3390/jsan12050068

Journal of Sensor and Actuator Networks (Sep 2023)

A Multi-Agent Intrusion Detection System Optimized by a Deep Reinforcement Learning Approach with a Dataset Enlarged Using a Generative Model to Reduce the Bias Effect

Matthieu Mouyart,
Guilherme Medeiros Machado,
Jae-Yun Jun

Affiliations

Matthieu Mouyart: LyRIDS, ECE Paris, 10 rue Sextius Michel, 75015 Paris, France
Guilherme Medeiros Machado: LyRIDS, ECE Paris, 10 rue Sextius Michel, 75015 Paris, France
Jae-Yun Jun: LyRIDS, ECE Paris, 10 rue Sextius Michel, 75015 Paris, France

DOI: https://doi.org/10.3390/jsan12050068
Journal volume & issue: Vol. 12, no. 5
p. 68

Abstract

Read online

Intrusion detection systems can defectively perform when they are adjusted with datasets that are unbalanced in terms of attack data and non-attack data. Most datasets contain more non-attack data than attack data, and this circumstance can introduce biases in intrusion detection systems, making them vulnerable to cyberattacks. As an approach to remedy this issue, we considered the Conditional Tabular Generative Adversarial Network (CTGAN), with its hyperparameters optimized using the tree-structured Parzen estimator (TPE), to balance an insider threat tabular dataset called the CMU-CERT, which is formed by discrete-value and continuous-value columns. We showed through this method that the mean absolute errors between the probability mass functions (PMFs) of the actual data and the PMFs of the data generated using the CTGAN can be relatively small. Then, from the optimized CTGAN, we generated synthetic insider threat data and combined them with the actual ones to balance the original dataset. We used the resulting dataset for an intrusion detection system implemented with the Adversarial Environment Reinforcement Learning (AE-RL) algorithm in a multi-agent framework formed by an attacker and a defender. We showed that the performance of detecting intrusions using the framework of the CTGAN and the AE-RL is significantly improved with respect to the case where the dataset is not balanced, giving an F1-score of 0.7617.

Published in Journal of Sensor and Actuator Networks

ISSN: 2224-2708 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology
Website: https://www.mdpi.com/journal/jsan

About the journal

Abstract

Keywords