LSTM RNN: detecting exploit kits using redirection chain sequences

Jonah Burgess; Philip O’Kane; Sakir Sezer; Domhnall Carlin

doi:10.1186/s42400-021-00093-7

Cybersecurity (Jul 2021)

LSTM RNN: detecting exploit kits using redirection chain sequences

Jonah Burgess,
Philip O’Kane,
Sakir Sezer,
Domhnall Carlin

Affiliations

Jonah Burgess: Centre for Secure Information Technologies (CSIT), Queen’s University Belfast, Northern Ireland Science Park
Philip O’Kane: Centre for Secure Information Technologies (CSIT), Queen’s University Belfast, Northern Ireland Science Park
Sakir Sezer: Centre for Secure Information Technologies (CSIT), Queen’s University Belfast, Northern Ireland Science Park
Domhnall Carlin: Centre for Secure Information Technologies (CSIT), Queen’s University Belfast, Northern Ireland Science Park

DOI: https://doi.org/10.1186/s42400-021-00093-7
Journal volume & issue: Vol. 4, no. 1
pp. 1 – 15

Abstract

Read online

Abstract While consumers use the web to perform routine activities, they are under the constant threat of attack from malicious websites. Even when visiting ‘trusted’ sites, there is always a risk that site is compromised, and, hosting a malicious script. In this scenario, the injected script would typically force the victim’s browser to undergo a series of redirects before reaching an attacker-controlled domain, which, delivers the actual malware. Although these malicious redirection chains aim to frustrate detection and analysis efforts, they could be used to help identify web-based attacks. Building upon previous work, this paper presents the first known application of a Long Short-Term Memory (LSTM) network to detect Exploit Kit (EK) traffic, utilising the structure of HTTP redirects. Samples are processed as sequences, where each timestep represents a redirect and contains a unique combination of 48 features. The experiment is conducted using a ground-truth dataset of 1279 EK and 5910 benign redirection chains. Hyper-parameters are tuned via K-fold cross-validation (5f-CV), with the optimal configuration achieving an F1 score of 0.9878 against the unseen test set. Furthermore, we compare the results of isolated feature categories to assess their importance.

Published in Cybersecurity

ISSN: 2523-3246 (Online)
Publisher: SpringerOpen
Country of publisher: Singapore
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics: Computer engineering. Computer hardware; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://cybersecurity.springeropen.com/

About the journal

Abstract

Keywords