Integrating Large Language Model, EEG, and Eye-Tracking for Word-Level Neural State Classification in Reading Comprehension

Yuhong Zhang; Qin Li; Sujal Nahata; Tasnia Jamal; Shih-Kuen Cheng; Gert Cauwenberghs; Tzyy-Ping Jung

doi:10.1109/tnsre.2024.3435460

IEEE Transactions on Neural Systems and Rehabilitation Engineering (Jan 2024)

Integrating Large Language Model, EEG, and Eye-Tracking for Word-Level Neural State Classification in Reading Comprehension

Yuhong Zhang,
Qin Li,
Sujal Nahata,
Tasnia Jamal,
Shih-Kuen Cheng,
Gert Cauwenberghs,
Tzyy-Ping Jung

Affiliations

Yuhong Zhang: ORCiD; Shu Chien-Gene Lay Department of Bioengineering, University of California San Diego, La Jolla, CA, USA
Qin Li: Department of Bioengineering, University of California at Los Angeles, Los Angeles, CA, USA
Sujal Nahata: Department of Computer Science and Engineering, University of California San Diego, La Jolla, CA, USA
Tasnia Jamal: ORCiD; Department of Electrical and Computer Engineering, University of California San Diego, La Jolla, CA, USA
Shih-Kuen Cheng: ORCiD; Institute of Cognitive Neuroscience, National Central University, Taoyuan, Taiwan
Gert Cauwenberghs: ORCiD; Shu Chien-Gene Lay Department of Bioengineering and the Institute for Neural Computation, University of California San Diego, La Jolla, CA, USA
Tzyy-Ping Jung: ORCiD; Institute for Neural Computation, University of California San Diego, La Jolla, CA, USA

DOI: https://doi.org/10.1109/tnsre.2024.3435460
Journal volume & issue: Vol. 32
pp. 3465 – 3475

Abstract

Read online

With the recent proliferation of large language models (LLMs), such as Generative Pre-trained Transformers (GPT), there has been a significant shift in exploring human and machine comprehension of semantic language meaning. This shift calls for interdisciplinary research that bridges cognitive science and natural language processing (NLP). This pilot study aims to provide insights into individuals’ neural states during a semantic inference reading-comprehension task. We propose jointly analyzing LLMs, eye-gaze, and electroencephalographic (EEG) data to study how the brain processes words with varying degrees of relevance to a keyword during reading. We also use feature engineering to improve the fixation-related EEG data classification while participants read words with high versus low relevance to the keyword. The best validation accuracy in this word-level classification is over 60% across 12 subjects. Words highly relevant to the inference keyword received significantly more eye fixations per word: 1.0584 compared to 0.6576, including words with no fixations. This study represents the first attempt to classify brain states at a word level using LLM-generated labels. It provides valuable insights into human cognitive abilities and Artificial General Intelligence (AGI), and offers guidance for developing potential reading-assisted technologies.

Published in IEEE Transactions on Neural Systems and Rehabilitation Engineering

ISSN: 1534-4320 (Print); 1558-0210 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Medicine: Medicine (General): Medical technology; Medicine: Therapeutics. Pharmacology
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=7333

About the journal

Abstract

Keywords