IEEE Access (Jan 2022)

System Failure Detection Using Deep Learning Models Integrating Timestamps With Nonuniform Intervals

  • Yixin Huangfu,
  • Saeid Habibi,
  • Alan Wassyng

DOI
https://doi.org/10.1109/ACCESS.2022.3150342
Journal volume & issue
Vol. 10
pp. 17629 – 17640

Abstract

Read online

System logs play an important role in software development and system maintenance. Many system software programs continuously generate system logs during software runtimes for failure detection and diagnosis purposes. Currently, the analysis of system log data is mainly a manual process that highly depends on human knowledge and experience. This time-consuming task has become a problem because of the ever-increasing volume of log data. Existing studies have investigated machine learning and deep learning techniques to automate the failure detection task. This paper takes the deep learning approach and proposes two detection structures based on recurrent and convolutional neural networks. More importantly, this paper takes a step further by closely examining the timestamps of log data which existing studies have generally ignored. This study found that time information can be a distinguishing factor between regular and abnormal log sequences. Inspired by this observation, a novel method is proposed to integrate log timestamps in deep learning models using interpolation techniques. The evaluation results show that the log timestamps can significantly improve the performance of failure detection. Cross-comparison of the different models demonstrates that the proposed network structure can successfully utilize the timestamp information. The code is available on GitHub: https://github.com/hfyxin/Ts-models-log-data-analysis.git.

Keywords