CPMI-ChatGLM: parameter-efficient fine-tuning ChatGLM with Chinese patent medicine instructions

Can Liu; Kaijie Sun; Qingqing Zhou; Yuchen Duan; Jianhua Shu; Hongxing Kan; Zongyun Gu; Jili Hu

doi:10.1038/s41598-024-56874-w

Scientific Reports (Mar 2024)

CPMI-ChatGLM: parameter-efficient fine-tuning ChatGLM with Chinese patent medicine instructions

Can Liu,
Kaijie Sun,
Qingqing Zhou,
Yuchen Duan,
Jianhua Shu,
Hongxing Kan,
Zongyun Gu,
Jili Hu

Affiliations

Can Liu: School of Medical Informatics Engineering, Anhui University of Traditional Chinese Medicine
Kaijie Sun: School of Medical Informatics Engineering, Anhui University of Traditional Chinese Medicine
Qingqing Zhou: School of Medical Informatics Engineering, Anhui University of Traditional Chinese Medicine
Yuchen Duan: School of Medical Informatics Engineering, Anhui University of Traditional Chinese Medicine
Jianhua Shu: School of Medical Informatics Engineering, Anhui University of Traditional Chinese Medicine
Hongxing Kan: School of Medical Informatics Engineering, Anhui University of Traditional Chinese Medicine
Zongyun Gu: School of Medical Informatics Engineering, Anhui University of Traditional Chinese Medicine
Jili Hu: School of Medical Informatics Engineering, Anhui University of Traditional Chinese Medicine

DOI: https://doi.org/10.1038/s41598-024-56874-w
Journal volume & issue: Vol. 14, no. 1
pp. 1 – 13

Abstract

Read online

Abstract Chinese patent medicine (CPM) is a typical type of traditional Chinese medicine (TCM) preparation that uses Chinese herbs as raw materials and is an important means of treating diseases in TCM. Chinese patent medicine instructions (CPMI) serve as a guide for patients to use drugs safely and effectively. In this study, we apply a pre-trained language model to the domain of CPM. We have meticulously assembled, processed, and released the first CPMI dataset and fine-tuned the ChatGLM-6B base model, resulting in the development of CPMI-ChatGLM. We employed consumer-grade graphics cards for parameter-efficient fine-tuning and investigated the impact of LoRA and P-Tuning v2, as well as different data scales and instruction data settings on model performance. We evaluated CPMI-ChatGLM using BLEU, ROUGE, and BARTScore metrics. Our model achieved scores of 0.7641, 0.8188, 0.7738, 0.8107, and − 2.4786 on the BLEU-4, ROUGE-1, ROUGE-2, ROUGE-L and BARTScore metrics, respectively. In comparison experiments and human evaluation with four large language models of similar parameter scales, CPMI-ChatGLM demonstrated state-of-the-art performance. CPMI-ChatGLM demonstrates commendable proficiency in CPM recommendations, making it a promising tool for auxiliary diagnosis and treatment. Furthermore, the various attributes in the CPMI dataset can be used for data mining and analysis, providing practical application value and research significance.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal