Development and validation of a novel AI framework using NLP with LLM integration for relevant clinical data extraction through automated chart review

Mert Marcel Dagli; Yohannes Ghenbot; Hasan S. Ahmad; Daksh Chauhan; Ryan Turlip; Patrick Wang; William C. Welch; Ali K. Ozturk; Jang W Yoon

doi:10.1038/s41598-024-77535-y

Scientific Reports (Nov 2024)

Development and validation of a novel AI framework using NLP with LLM integration for relevant clinical data extraction through automated chart review

Mert Marcel Dagli,
Yohannes Ghenbot,
Hasan S. Ahmad,
Daksh Chauhan,
Ryan Turlip,
Patrick Wang,
William C. Welch,
Ali K. Ozturk,
Jang W Yoon

Affiliations

Mert Marcel Dagli: Department of Neurosurgery, Perelman School of Medicine, University of Pennsylvania
Yohannes Ghenbot: Department of Neurosurgery, Perelman School of Medicine, University of Pennsylvania
Hasan S. Ahmad: Department of Neurosurgery, Perelman School of Medicine, University of Pennsylvania
Daksh Chauhan: Department of Neurosurgery, Perelman School of Medicine, University of Pennsylvania
Ryan Turlip: Department of Neurosurgery, Perelman School of Medicine, University of Pennsylvania
Patrick Wang: Department of Neurosurgery, Perelman School of Medicine, University of Pennsylvania
William C. Welch: Department of Neurosurgery, Perelman School of Medicine, University of Pennsylvania
Ali K. Ozturk: Department of Neurosurgery, Perelman School of Medicine, University of Pennsylvania
Jang W Yoon: Department of Neurosurgery, Perelman School of Medicine, University of Pennsylvania

DOI: https://doi.org/10.1038/s41598-024-77535-y
Journal volume & issue: Vol. 14, no. 1
pp. 1 – 7

Abstract

Read online

Abstract The accurate extraction of surgical data from electronic health records (EHRs), particularly operative notes through manual chart review (MCR), is complex, crucial, and time-intensive, limited by human error due to fatigue and the level of training. This study aimed to develop and validate a novel Natural Language Processing (NLP) algorithm integrated with a Large Language Model (LLM; GPT4-Turbo) to automate the extraction of spinal surgery data from EHRs. The algorithm employed a two-stage approach. Initially, a rule-based NLP framework reviewed and classified candidate segments from the text, preserving their reference segments. These segments were then verified in the second stage through the LLM. The primary outcomes of this study were the accurate extraction of surgical data, including the type of surgery, levels operated, number of disks removed, and presence of intraoperative incidental durotomies. Secondary objectives explored time efficiency, tokenization lengths, and costs. The performance of the algorithm was assessed across two validation databases, analyzing metrics such as accuracy, sensitivity, discrimination, F1-score, and precision, with 95% confidence intervals calculated using percentile-based bootstrapping. The NLP + LLM algorithm markedly outperformed all performance metrics, demonstrating significant improvements in time and cost efficiency. These results suggest the potential for widespread adoption of this technology.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal

Abstract

Keywords