Leveraging Large Language Models and BERT for Log Parsing and Anomaly Detection

Yihan Zhou; Yan Chen; Xuanming Rao; Yukang Zhou; Yuxin Li; Chao Hu

doi:10.3390/math12172758

Mathematics (Sep 2024)

Leveraging Large Language Models and BERT for Log Parsing and Anomaly Detection

Yihan Zhou,
Yan Chen,
Xuanming Rao,
Yukang Zhou,
Yuxin Li,
Chao Hu

Affiliations

Yihan Zhou: School of Computer Science and Engineering, Central South University, Changsha 410083, China
Yan Chen: Logistics Department, Central South University, Changsha 410083, China
Xuanming Rao: School of Computer Science and Engineering, Central South University, Changsha 410083, China
Yukang Zhou: Department of Electrical and Information Engineering, Hong Kong Polytechnic University, Hong Kong, China
Yuxin Li: School of Computer Science and Engineering, Nanyang Technological University, Singapore 639798, Singapore
Chao Hu: School of Electronic Information, Central South University, Changsha 410083, China

DOI: https://doi.org/10.3390/math12172758
Journal volume & issue: Vol. 12, no. 17
p. 2758

Abstract

Read online

Computer systems and applications generate large amounts of logs to measure and record information, which is vital to protect the systems from malicious attacks and useful for repairing faults, especially with the rapid development of distributed computing. Among various logs, the anomaly log is beneficial for operations and maintenance (O&M) personnel to locate faults and improve efficiency. In this paper, we utilize a large language model, ChatGPT, for the log parser task. We choose the BERT model, a self-supervised framework for log anomaly detection. BERT, an embedded transformer encoder, with a self-attention mechanism can better handle context-dependent tasks such as anomaly log detection. Meanwhile, it is based on the masked language model task and next sentence prediction task in the pretraining period to capture the normal log sequence pattern. The experimental results on two log datasets show that the BERT model combined with an LLM performed better than other classical models such as Deelog and Loganomaly.

Published in Mathematics

ISSN: 2227-7390 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Mathematics
Website: http://www.mdpi.com/journal/mathematics

About the journal

Abstract

Keywords