A scientific-article key-insight extraction system based on multi-actor of fine-tuned open-source large language models

Zihan Song; Gyo-Yeob Hwang; Xin Zhang; Shan Huang; Byung-Kwon Park

doi:10.1038/s41598-025-85715-7

Scientific Reports (Jan 2025)

A scientific-article key-insight extraction system based on multi-actor of fine-tuned open-source large language models

Zihan Song,
Gyo-Yeob Hwang,
Xin Zhang,
Shan Huang,
Byung-Kwon Park

Affiliations

Zihan Song: Dong-A University
Gyo-Yeob Hwang: Dong-A University
Xin Zhang: Dong-A University
Shan Huang: Dong-A University
Byung-Kwon Park: Dong-A University

DOI: https://doi.org/10.1038/s41598-025-85715-7
Journal volume & issue: Vol. 15, no. 1
pp. 1 – 11

Abstract

Read online

Abstract The exponential growth of scientific articles has presented challenges in information organization and extraction. Automation is urgently needed to streamline literature reviews and enhance insight extraction. We explore the potential of Large Language Models (LLMs) in key-insights extraction from scientific articles, including OpenAI’s GPT-4.0, MistralAI’s Mixtral 8 × 7B, 01AI’s Yi, and InternLM’s InternLM2. We have developed an article-level key-insight extraction system based on LLMs, calling it ArticleLLM. After evaluating the LLMs against manual benchmarks, we have enhanced their performance through fine-tuning. We propose a multi-actor LLM approach, merging the strengths of multiple fine-tuned LLMs to improve overall key-insight extraction performance. This work demonstrates not only the feasibility of LLMs in key-insight extraction, but also the effectiveness of cooperation of multiple fine-tuned LLMs, leading to efficient academic literature survey and knowledge discovery.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal

Abstract

Keywords