Leveraging Large Language Models with Chain-of-Thought and Prompt Engineering for Traffic Crash Severity Analysis and Inference

Hao Zhen; Yucheng Shi; Yongcan Huang; Jidong J. Yang; Ninghao Liu

doi:10.3390/computers13090232

Computers (Sep 2024)

Leveraging Large Language Models with Chain-of-Thought and Prompt Engineering for Traffic Crash Severity Analysis and Inference

Hao Zhen,
Yucheng Shi,
Yongcan Huang,
Jidong J. Yang,
Ninghao Liu

Affiliations

Hao Zhen: School of Environmental, Civil, Agricultural, and Mechanical Engineering, University of Georgia, Athens, GA 30605, USA
Yucheng Shi: School of Computing, University of Georgia, Athens, GA 30605, USA
Yongcan Huang: School of Environmental, Civil, Agricultural, and Mechanical Engineering, University of Georgia, Athens, GA 30605, USA
Jidong J. Yang: School of Environmental, Civil, Agricultural, and Mechanical Engineering, University of Georgia, Athens, GA 30605, USA
Ninghao Liu: School of Computing, University of Georgia, Athens, GA 30605, USA

DOI: https://doi.org/10.3390/computers13090232
Journal volume & issue: Vol. 13, no. 9
p. 232

Abstract

Read online

Harnessing the power of Large Language Models (LLMs), this study explores the use of three state-of-the-art LLMs, specifically GPT-3.5-turbo, LLaMA3-8B, and LLaMA3-70B, for crash severity analysis and inference, framing it as a classification task. We generate textual narratives from original traffic crash tabular data using a pre-built template infused with domain knowledge. Additionally, we incorporated Chain-of-Thought (CoT) reasoning to guide the LLMs in analyzing the crash causes and then inferring the severity. This study also examine the impact of prompt engineering specifically designed for crash severity inference. The LLMs were tasked with crash severity inference to: (1) evaluate the models’ capabilities in crash severity analysis, (2) assess the effectiveness of CoT and domain-informed prompt engineering, and (3) examine the reasoning abilities with the CoT framework. Our results showed that LLaMA3-70B consistently outperformed the other models, particularly in zero-shot settings. The CoT and Prompt Engineering techniques significantly enhanced performance, improving logical reasoning and addressing alignment issues. Notably, the CoT offers valuable insights into LLMs’ reasoning process, unleashing their capacity to consider diverse factors such as environmental conditions, driver behavior, and vehicle characteristics in severity analysis and inference.

Published in Computers

ISSN: 2073-431X (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://www.mdpi.com/journal/computers

About the journal

Abstract

Keywords