Mayo Clinic Proceedings: Digital Health (Sep 2024)
Automated Identification of Patients’ Unmet Social Needs in Clinical Text Using Natural Language Processing
Abstract
Objective: To develop natural language processing (NLP) solutions for identifying patients’ unmet social needs to enable timely intervention. Patients and Methods: Design: A retrospective cohort study with review and annotation of clinical notes to identify unmet social needs, followed by using the annotations to develop and evaluate NLP solutions. Participants: A total of 1103 primary care patients seen at a large academic medical center from June 1, 2019, to May 31, 2021 and referred to a community health worker (CHW) program. Clinical notes and portal messages of 200 age and sex-stratified patients were sampled for annotation of unmet social needs. Systems: Two NLP solutions were developed and compared. The first solution employed similarity-based classification on top of sentences represented as semantic embedding vectors. The second solution involved designing of terms and patterns for identifying each domain of unmet social needs in the clinical text. Measures: Precision, recall, and f1-score of the NLP solutions. Results: A total of 5675 clinical notes and 475 portal messages were annotated, with an inter-annotator agreement of 0.938. The best NLP solution achieved an f1-score of 0.95 and was applied to the entire CHW-referred cohort (n=1103), of whom >80% had at least 1 unmet social need within the 6 months before the first CHW referral. Financial strain and health literacy were the top 2 domains of unmet social needs across most of the sex and age strata. Conclusion: Clinical text contains rich information about patients’ unmet social needs. The NLP can achieve good performance in identifying those needs for CHW referral and facilitate data-driven research on social determinants of health.