Research on Science and Technology Policy and Regulation Q&amp;A System Driven by Large Models

XIANG Xiaowei, SHEN Yanguang, HU Minghao, YAN Tianwei, LUO Wei, LUO Zhunchen

doi:10.3778/j.issn.1673-9418.2406023

Jisuanji kexue yu tansuo (Sep 2024)

Research on Science and Technology Policy and Regulation Q&A System Driven by Large Models

XIANG Xiaowei, SHEN Yanguang, HU Minghao, YAN Tianwei, LUO Wei, LUO Zhunchen

Affiliations

XIANG Xiaowei, SHEN Yanguang, HU Minghao, YAN Tianwei, LUO Wei, LUO Zhunchen: 1. School of Information and Electrical Engineering, Hebei University of Engineering, Handan, Hebei 056038, China 2. Military Science Information Research Center, Academy of Military Sciences, Beijing 100142, China 3. College of Computer, National University of Defense Technology, Changsha 410037, China

DOI: https://doi.org/10.3778/j.issn.1673-9418.2406023
Journal volume & issue: Vol. 18, no. 9
pp. 2349 – 2360

Abstract

Read online

A question-and-answer (Q&A) system for science and technology (S&T) policies and regulations plays a critical role in helping the public understand and apply these regulations. Large language models (LLM) can significantly enhance the accuracy and efficiency of such systems. However, current LLM-based S&T policy and regulation Q&A systems face several challenges: the lack of large-scale, high-quality datasets, insufficient methods for auto-matically constructing datasets with accurate policy and regulation knowledge integration, and issues with the professional accuracy and timeliness of the models’ knowledge updates. To address these challenges, this paper proposes a retrieval-augmented self-prompting method for constructing a high-quality, large-scale S&T policy and regulation Q&A dataset. Additionally, a Q&A system is developed, which combines an LLM optimized by low-rank adaptation (LoRA) techniques with an S&T policy and regulation knowledge base, and employs prompt learning techniques to guide the system in generating accurate answers. Experimental results demonstrate that the constructed Q&A dataset significantly improves the integration of policy and regulation knowledge compared with traditional methods. Furthermore, the proposed Q&A system outperforms general LLM-driven systems across various metrics, highlighting its enhanced performance in the domain of S&T policies and regulations.

large language model; question-and-answer dataset; low-rank adaptive fine-tuning; prompt learning; science and technology policy and regulation; question-and-answer system

Published in Jisuanji kexue yu tansuo

ISSN: 1673-9418 (Print)
Publisher: Journal of Computer Engineering and Applications Beijing Co., Ltd., Science Press
Country of publisher: China
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://fcst.ceaj.org

About the journal

Abstract

Keywords