LOGIC: LLM-originated guidance for internal cognitive improvement of small language models in stance detection

Woojin Lee; Jaewook Lee; Harksoo Kim

doi:10.7717/peerj-cs.2585

PeerJ Computer Science (Dec 2024)

LOGIC: LLM-originated guidance for internal cognitive improvement of small language models in stance detection

Woojin Lee,
Jaewook Lee,
Harksoo Kim

Affiliations

Woojin Lee: Department of Artificial Intelligence, Konkuk University, Seoul, Republic of South Korea
Jaewook Lee: Department of Artificial Intelligence, Konkuk University, Seoul, Republic of South Korea
Harksoo Kim: Department of Computer Science and Engineering, Konkuk University, Seoul, Republic of South Korea

DOI: https://doi.org/10.7717/peerj-cs.2585
Journal volume & issue: Vol. 10
p. e2585

Abstract

Read online Read online

Stance detection is a critical task in natural language processing that determines an author’s viewpoint toward a specific target, playing a pivotal role in social science research and various applications. Traditional approaches incorporating Wikipedia-sourced data into small language models (SLMs) to compensate for limited target knowledge often suffer from inconsistencies in article quality and length due to the diverse pool of Wikipedia contributors. To address these limitations, we utilize large language models (LLMs) pretrained on expansive datasets to generate accurate and contextually relevant target knowledge. By providing concise, real-world insights tailored to the stance detection task, this approach surpasses the limitations of Wikipedia-based information. Despite their superior reasoning capabilities, LLMs are computationally intensive and challenging to deploy on smaller devices. To mitigate these drawbacks, we introduce a reasoning distillation methodology that transfers the reasoning capabilities of LLMs to more compact SLMs, enhancing their efficiency while maintaining robust performance. Our stance detection model, LOGIC (LLM-Originated Guidance for Internal Cognitive improvement of small language models in stance detection), is built on Bidirectional and Auto-Regressive Transformer (BART) and fine-tuned with auxiliary learning tasks, including reasoning distillation. By incorporating LLM-generated target knowledge into the inference process, LOGIC achieves state-of-the-art performance on the VAried Stance Topics (VAST) dataset, outperforming advanced models like GPT-3.5 Turbo and GPT-4 Turbo in stance detection tasks.

Published in PeerJ Computer Science

ISSN: 2376-5992 (Online)
Publisher: PeerJ Inc.
Country of publisher: United States
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://peerj.com/computer-science/

About the journal

Abstract

Keywords