CNN Based Malicious Website Detection by Invalidating Multiple Web Spams

Dongjie Liu; Jong-Hyouk Lee

doi:10.1109/ACCESS.2020.2995157

IEEE Access (Jan 2020)

CNN Based Malicious Website Detection by Invalidating Multiple Web Spams

Dongjie Liu,
Jong-Hyouk Lee

Affiliations

Dongjie Liu: ORCiD; Computer Network Information Center, Chinese Academy of Sciences, Beijing, China
Jong-Hyouk Lee: ORCiD; Department of Computer and Information Security, Sejong University, Seoul, South Korea

DOI: https://doi.org/10.1109/ACCESS.2020.2995157
Journal volume & issue: Vol. 8
pp. 97258 – 97266

Abstract

Read online

Although a variety of techniques to detect malicious websites have been proposed, it becomes more and more difficult for those methods to provide a satisfying result nowadays. Many malicious websites can still escape detection with various Web spam techniques. In this paper, we first summarize three types of Web spam techniques used by malicious websites, such as redirection spam, hidden IFrame spam, and content hiding spam. We then present a new detection method that adopts the perspective of users and takes screenshots of malicious webpages to invalidate Web spams. The proposed detection method uses a Convolutional Neural Network, which is a class of deep neural networks, as a classification algorithm. In order to verify the effectiveness of the method, two different experiments have been conducted. First, the proposed method was tested based on a constructed complex dataset. We present comparison results between the proposed method and representative machine learning-based detection algorithms. Second, the proposed method was tested to detect malicious websites in a real-world Web environment for three months. These experimental results illustrate that the proposed method has a better performance and is applicable to a practical Web environment.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords