Insider Threat Identification Using the Simultaneous Neural Learning of Multi-Source Logs

Liu Liu; Chao Chen; Jun Zhang; Olivier De Vel; Yang Xiang

doi:10.1109/ACCESS.2019.2957055

IEEE Access (Jan 2019)

Insider Threat Identification Using the Simultaneous Neural Learning of Multi-Source Logs

Liu Liu,
Chao Chen,
Jun Zhang,
Olivier De Vel,
Yang Xiang

Affiliations

Liu Liu: ORCiD; School of Software and Electrical Engineering, Swinburne University of Technology, Hawthorn, VIC, Australia
Chao Chen: ORCiD; School of Software and Electrical Engineering, Swinburne University of Technology, Hawthorn, VIC, Australia
Jun Zhang: ORCiD; School of Software and Electrical Engineering, Swinburne University of Technology, Hawthorn, VIC, Australia
Olivier De Vel: ORCiD; Department of Defence, Defence Science and Technology Group, Edinburgh, SA, Australia
Yang Xiang: ORCiD; School of Software and Electrical Engineering, Swinburne University of Technology, Hawthorn, VIC, Australia

DOI: https://doi.org/10.1109/ACCESS.2019.2957055
Journal volume & issue: Vol. 7
pp. 183162 – 183176

Abstract

Read online

Insider threat detection has drawn increasing attention in recent years. In order to capture a malicious insider's digital footprints that occur scatteredly across a wide range of audit data sources over a long period of time, existing approaches often leverage a scoring mechanism to orchestrate alerts generated from multiple sub-detectors, or require domain knowledge-based feature engineering to conduct a one-off analysis across multiple types of data. These approaches result in a high deployment complexity and incur additional costs for engaging security experts. In this paper, we present a novel approach that works with a variety of security logs. The security logs are transformed into texts in the same format and then arranged as a corpus. Using the model trained by Word2vec with the corpus, we are enabled to approximate the posterior probabilities for insider behaviours. Accordingly, we label the transformed events as suspicious if their behavioural probabilities are smaller than a given threshold, and a user is labelled as malicious if he/she is associated with multiple suspicious events. The experiments are undertaken with the Carnegie Mellon University (CMU) CERT Programs insider threat database v6.2, which not only demonstrate that the proposed approach is effective and scalable in practical applications but also provide a guidance for tuning the parameters and thresholds.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords