An Empirical Study of the Code Generation of Safety-Critical Software Using LLMs

Mingxing Liu; Junfeng Wang; Tao Lin; Quan Ma; Zhiyang Fang; Yanqun Wu

doi:10.3390/app14031046

Applied Sciences (Jan 2024)

An Empirical Study of the Code Generation of Safety-Critical Software Using LLMs

Mingxing Liu,
Junfeng Wang,
Tao Lin,
Quan Ma,
Zhiyang Fang,
Yanqun Wu

Affiliations

Mingxing Liu: College of Computer Science, Sichuan University, Chengdu 610065, China
Junfeng Wang: College of Computer Science, Sichuan University, Chengdu 610065, China
Tao Lin: College of Computer Science, Sichuan University, Chengdu 610065, China
Quan Ma: Science and Technology on Reactor System Design Technology Laboratory, Nuclear Power Institute of China, Chengdu 610041, China
Zhiyang Fang: College of Computer Science, Sichuan University, Chengdu 610065, China
Yanqun Wu: College of Computer Science, Sichuan University, Chengdu 610065, China

DOI: https://doi.org/10.3390/app14031046
Journal volume & issue: Vol. 14, no. 3
p. 1046

Abstract

Read online

In the digital era of increasing software complexity, improving the development efficiency of safety-critical software is a challenging task faced by academia and industry in domains such as nuclear energy, aviation, the automotive industry, and rail transportation. Recently, people have been excited about using pre-trained large language models (LLMs) such as ChatGPT and GPT-4 to generate code. Professionals in the safety-critical software field are intrigued by the code generation capabilities of LLMs. However, there is currently a lack of systematic case studies in this area. Aiming at the need for automated code generation in safety-critical domains such as nuclear energy and the automotive industry, this paper conducts a case study on generating safety-critical software code using GPT-4 as the tool. Practical engineering cases from the industrial domain are employed. We explore different approaches, including code generation based on overall requirements, specific requirements, and augmented prompts. We propose a novel prompt engineering method called Prompt-FDC that integrates basic functional requirements, domain feature generalization, and domain constraints. This method improves code completeness from achieving 30% functions to 100% functions, increases the code comment rate to 26.3%, and yields better results in terms of code compliance, readability, and maintainability. The code generation approach based on LLMs also introduces a new software development process and V-model lifecycle for safety-critical software. Through systematic case studies, we demonstrate that, with appropriate prompt methods, LLMs can auto-generate safety-critical software code that meets practical engineering application requirements. It is foreseeable that LLMs can be applied to various engineering domains to improve software safety and development efficiency.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords