Techne (Nov 2024)
Optimizing Imbalanced Data Classification: Under Sampling Algorithm Strategy with Classification Combination
Abstract
The security of Internet of Things devices is a factor that must be considered because device damage and data theft can occur. Internet of Things devices are very useful in various sectors, such as health, transportation, and industrial sectors. Attacks on Internet of Things devices increase every year. To overcome this, it is necessary to take a research approach with machine learning. The dataset used is CIC IoT Attacks 2023 from the University Of New Brunswick. To be able to produce good data, it is necessary to do random under sampling as a way to overcome data imbalance. Then, modeling is done using the KNN algorithm, Random Forest, Logistic Regression, Adaboost, And Perceptron. The result of this research is that random forest has the best accuracy result of 99.73%. From these results, it can be concluded that the random under-sampling technique can improve the accuracy of data imbalance.
Keywords