npj Digital Medicine (Mar 2025)

Natural language processing of electronic health records for early detection of cognitive decline: a systematic review

  • Ravi Shankar,
  • Anjali Bundele,
  • Amartya Mukhopadhyay

DOI
https://doi.org/10.1038/s41746-025-01527-z
Journal volume & issue
Vol. 8, no. 1
pp. 1 – 10

Abstract

Read online

Abstract This systematic review evaluated natural language processing (NLP) approaches for detecting cognitive impairment in electronic health record clinical notes. Following PRISMA guidelines, we analyzed 18 studies (n = 1,064,530) that employed rule-based algorithms (67%), traditional machine learning (28%), and deep learning (17%). NLP models demonstrated robust performance in identifying cognitive decline, with median sensitivity 0.88 (IQR 0.74–0.91) and specificity 0.96 (IQR 0.81–0.99). Deep learning architectures achieved superior results, with area under the receiver operating characteristic curves up to 0.997. Major implementation challenges included incomplete electronic health record data capture, inconsistent clinical documentation practices, and limited external validation. While NLP demonstrates promise, successful clinical translation requires establishing standardized approaches, improving access to annotated datasets, and developing equitable deployment frameworks.