AutoML-ID: automated machine learning model for intrusion detection using wireless sensor network

Abhilash Singh; J. Amutha; Jaiprakash Nagar; Sandeep Sharma; Cheng-Chi Lee

doi:10.1038/s41598-022-13061-z

Scientific Reports (May 2022)

AutoML-ID: automated machine learning model for intrusion detection using wireless sensor network

Abhilash Singh,
J. Amutha,
Jaiprakash Nagar,
Sandeep Sharma,
Cheng-Chi Lee

Affiliations

Abhilash Singh: Indian Institute of Science Education and Research Bhopal, Fluvial Geomorphology and Remote Sensing Laboratory
J. Amutha: Gautam Buddha University, School of ICT
Jaiprakash Nagar: Indian Institute of Technology Kharagpur, Subir Chowdhury School of Quality and Reliability
Sandeep Sharma: Department of Electronics Engineering, Madhav Institute of Technology and Science
Cheng-Chi Lee: Department of Library and Information Science, Research and Development, Center for Physical Education, Health, and Information Technology, Fu Jen Catholic University

DOI: https://doi.org/10.1038/s41598-022-13061-z
Journal volume & issue: Vol. 12, no. 1
pp. 1 – 14

Abstract

Read online

Abstract Momentous increase in the popularity of explainable machine learning models coupled with the dramatic increase in the use of synthetic data facilitates us to develop a cost-efficient machine learning model for fast intrusion detection and prevention at frontier areas using Wireless Sensor Networks (WSNs). The performance of any explainable machine learning model is driven by its hyperparameters. Several approaches have been developed and implemented successfully for optimising or tuning these hyperparameters for skillful predictions. However, the major drawback of these techniques, including the manual selection of the optimal hyperparameters, is that they depend highly on the problem and demand application-specific expertise. In this paper, we introduced Automated Machine Learning (AutoML) model to automatically select the machine learning model (among support vector regression, Gaussian process regression, binary decision tree, bagging ensemble learning, boosting ensemble learning, kernel regression, and linear regression model) and to automate the hyperparameters optimisation for accurate prediction of numbers of k-barriers for fast intrusion detection and prevention using Bayesian optimisation. To do so, we extracted four synthetic predictors, namely, area of the region, sensing range of the sensor, transmission range of the sensor, and the number of sensors using Monte Carlo simulation. We used 80% of the datasets to train the models and the remaining 20% for testing the performance of the trained model. We found that the Gaussian process regression performs prodigiously and outperforms all the other considered explainable machine learning models with correlation coefficient (R = 1), root mean square error (RMSE = 0.007), and bias = − 0.006. Further, we also tested the AutoML performance on a publicly available intrusion dataset, and we observed a similar performance. This study will help the researchers accurately predict the required number of k-barriers for fast intrusion detection and prevention.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal