Could dementia be detected from UK primary care patients’ records by simple automated methods earlier than by the treating physician? A retrospective case-control study [version 1; peer review: 2 approved]

Elizabeth Ford; Johannes Starlinger; Philip Rooney; Seb Oliver; Sube Banerjee; Harm van Marwijk; Jackie Cassell

doi:10.12688/wellcomeopenres.15903.1

Wellcome Open Research (Jun 2020)

Could dementia be detected from UK primary care patients’ records by simple automated methods earlier than by the treating physician? A retrospective case-control study [version 1; peer review: 2 approved]

Elizabeth Ford,
Johannes Starlinger,
Philip Rooney,
Seb Oliver,
Sube Banerjee,
Harm van Marwijk,
Jackie Cassell

Affiliations

Elizabeth Ford: Department of Primary Care and Public Health, Brighton and Sussex Medical School, Watson Building, Village Way, Falmer, Brighton, BN1 9PH, UK
Johannes Starlinger: Department of Computer Science, Humboldt University of Berlin, Rudower Chaussee 25, Berlin, 12489, Germany
Philip Rooney: Department of Physics and Astronomy, University of Sussex, Brighton, BN1 9RQ, UK
Seb Oliver: Department of Physics and Astronomy, University of Sussex, Brighton, BN1 9RQ, UK
Sube Banerjee: Faculty of Health, University of Plymouth, Drake Circus, Plymouth, Devon, PL4 8AA, UK
Harm van Marwijk: Department of Primary Care and Public Health, Brighton and Sussex Medical School, Watson Building, Village Way, Falmer, Brighton, BN1 9PH, UK
Jackie Cassell: Department of Primary Care and Public Health, Brighton and Sussex Medical School, Watson Building, Village Way, Falmer, Brighton, BN1 9PH, UK

DOI: https://doi.org/10.12688/wellcomeopenres.15903.1
Journal volume & issue: Vol. 5

Abstract

Read online

Background: Timely diagnosis of dementia is a policy priority in the United Kingdom (UK). Primary care physicians receive incentives to diagnose dementia; however, 33% of patients are still not receiving a diagnosis. We explored automating early detection of dementia using data from patients’ electronic health records (EHRs). We investigated: a) how early a machine-learning model could accurately identify dementia before the physician; b) if models could be tuned for dementia subtype; and c) what the best clinical features were for achieving detection. Methods: Using EHRs from Clinical Practice Research Datalink in a case-control design, we selected patients aged >65y with a diagnosis of dementia recorded 2000-2012 (cases) and matched them 1:1 to controls; we also identified subsets of Alzheimer’s and vascular dementia patients. Using 77 coded concepts recorded in the 5 years before diagnosis, we trained random forest classifiers, and evaluated models using Area Under the Receiver Operating Characteristic Curve (AUC). We examined models by year prior to diagnosis, subtype, and the most important features contributing to classification. Results: 95,202 patients (median age 83y; 64.8% female) were included (50% dementia cases). Classification of dementia cases and controls was poor 2-5 years prior to physician-recorded diagnosis (AUC range 0.55-0.65) but good in the year before (AUC: 0.84). Features indicating increasing cognitive and physical frailty dominated models 2-5 years before diagnosis; in the final year, initiation of the dementia diagnostic pathway (symptoms, screening and referral) explained the sudden increase in accuracy. No substantial differences were seen between all-cause dementia and subtypes. Conclusions: Automated detection of dementia earlier than the treating physician may be problematic, if using only primary care data. Future work should investigate more complex modelling, benefits of linking multiple sources of healthcare data and monitoring devices, or contextualising the algorithm to those cases that the GP would need to investigate.

Published in Wellcome Open Research

ISSN: 2398-502X (Online)
Publisher: Wellcome
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://wellcomeopenresearch.org/

About the journal