Scientific Reports (Oct 2024)

Sentiment classification for insider threat identification using metaheuristic optimized machine learning classifiers

  • Djordje Mladenovic,
  • Milos Antonijevic,
  • Luka Jovanovic,
  • Vladimir Simic,
  • Miodrag Zivkovic,
  • Nebojsa Bacanin,
  • Tamara Zivkovic,
  • Jasmina Perisic

DOI
https://doi.org/10.1038/s41598-024-77240-w
Journal volume & issue
Vol. 14, no. 1
pp. 1 – 39

Abstract

Read online

Abstract This study examines the formidable and complex challenge of insider threats to organizational security, addressing risks such as ransomware incidents, data breaches, and extortion attempts. The research involves six experiments utilizing email, HTTP, and file content data. To combat insider threats, emerging Natural Language Processing techniques are employed in conjunction with powerful Machine Learning classifiers, specifically XGBoost and AdaBoost. The focus is on recognizing the sentiment and context of malicious actions, which are considered less prone to change compared to commonly tracked metrics like location and time of access. To enhance detection, a term frequency-inverse document frequency-based approach is introduced, providing a more robust, adaptable, and maintainable method. Moreover, the study acknowledges the significant impact of hyperparameter selection on classifier performance and employs various contemporary optimizers, including a modified version of the red fox optimization algorithm. The proposed approach undergoes testing in three simulated scenarios using a public dataset, showcasing commendable outcomes.

Keywords