IEEE Access (Jan 2021)

Malicious URL Detection Based on a Parallel Neural Joint Model

  • Jianting Yuan,
  • Guanxin Chen,
  • Shengwei Tian,
  • Xinjun Pei

DOI
https://doi.org/10.1109/ACCESS.2021.3049625
Journal volume & issue
Vol. 9
pp. 9464 – 9472

Abstract

Read online

A parallel neural joint model algorithm is proposed for the analysis and detection of malicious Uniform Resource Locator (URL). By detecting and analyzing malicious URL's characteristics, the semantic and visual information will be extracted. First, a visualization algorithm is used to realize the visualization of the URL mapping to a gray image with texture characteristics. Second, the lexical feature and character feature of URL are extracted and further processed through word vector technology. These extracted features are transformed into lexical embedding vectors and character embedding vectors. To combine the texture features with text features, a parallel joint neural network combining capsule network (CapsNet) and independent recurrent neural network (IndRNN) is utilized to capture multi-modal vectors of visual and semantic information synchronously. The last layer utilizes the attention mechanism to further filter the deep features extracted from the overall network while concentrating on effective features improving the classification accuracy and analyzing and detect malicious URLs. Based on the experimental results, it is demonstrated that this algorithm has higher accuracy compared to the traditional algorithms.

Keywords