Mathematical Biosciences and Engineering (Jul 2023)

CDBC: A novel data enhancement method based on improved between-class learning for darknet detection

  • Binjie Song,
  • Yufei Chang ,
  • Minxi Liao,
  • Yuanhang Wang ,
  • Jixiang Chen,
  • Nianwang Wang

DOI
https://doi.org/10.3934/mbe.2023670
Journal volume & issue
Vol. 20, no. 8
pp. 14959 – 14977

Abstract

Read online

With the development of the Internet, people have paid more attention to privacy protection, and privacy protection technology is widely used. However, it also breeds the darknet, which has become a tool that criminals can exploit, especially in the fields of economic crime and military intelligence. The darknet detection is becoming increasingly important; however, the darknet traffic is seriously unbalanced. The detection is difficult and the accuracy of the detection methods needs to be improved. To overcome these problems, we first propose a novel learning method. The method is the Chebyshev distance based Between-class learning (CDBC), which can learn the spatial distribution of the darknet dataset, and generate "gap data". The gap data can be adopted to optimize the distribution boundaries of the dataset. Second, a novel darknet traffic detection method is proposed. We test the proposed method on the ISCXTor 2016 dataset and the CIC-Darknet 2020 dataset, and the results show that CDBC can help more than 10 existing methods improve accuracy, even up to 99.99%. Compared with other sampling methods, CDBC can also help the classifiers achieve higher recall.

Keywords