Impact of data source choice on multimorbidity measurement: a comparison study of 2.3 million individuals in the Welsh National Health Service

Clare MacRae; Daniel Morales; Stewart W. Mercer; Nazir Lone; Andrew Lawson; Emily Jefferson; David McAllister; Marjan van den Akker; Alan Marshall; Sohan Seth; Anna Rawlings; Jane Lyons; Ronan A. Lyons; Amy Mizen; Eleojo Abubakar; Chris Dibben; Bruce Guthrie

doi:10.1186/s12916-023-02970-z

BMC Medicine (Aug 2023)

Impact of data source choice on multimorbidity measurement: a comparison study of 2.3 million individuals in the Welsh National Health Service

Clare MacRae,
Daniel Morales,
Stewart W. Mercer,
Nazir Lone,
Andrew Lawson,
Emily Jefferson,
David McAllister,
Marjan van den Akker,
Alan Marshall,
Sohan Seth,
Anna Rawlings,
Jane Lyons,
Ronan A. Lyons,
Amy Mizen,
Eleojo Abubakar,
Chris Dibben,
Bruce Guthrie

Affiliations

Clare MacRae: Advanced Care Research Centre, University of Edinburgh
Daniel Morales: Division of Population Health and Genomics, University of Dundee
Stewart W. Mercer: Advanced Care Research Centre, University of Edinburgh
Nazir Lone: Usher Institute, College of Medicine and Veterinary Medicine, University of Edinburgh
Andrew Lawson: Usher Institute, College of Medicine and Veterinary Medicine, University of Edinburgh
Emily Jefferson: Division of Population Health and Genomics, University of Dundee
David McAllister: Public Health, Institute of Health and Wellbeing, University of Glasgow
Marjan van den Akker: Institute of General Practice, Goethe University Frankfurt
Alan Marshall: School of Social and Political Science, University of Edinburgh, Chrystal Macmillan Building
Sohan Seth: School of Informatics, The University of Edinburgh
Anna Rawlings: Swansea University Medical School, Data Science Building, Singleton Campus
Jane Lyons: Swansea University Medical School, Data Science Building, Singleton Campus
Ronan A. Lyons: Swansea University Medical School, Data Science Building, Singleton Campus
Amy Mizen: Swansea University Medical School, Data Science Building, Singleton Campus
Eleojo Abubakar: Public Health, Institute of Health and Wellbeing, University of Glasgow
Chris Dibben: University of Edinburgh Institute of Geography, Institute of Geography Edinburgh
Bruce Guthrie: Advanced Care Research Centre, University of Edinburgh

DOI: https://doi.org/10.1186/s12916-023-02970-z
Journal volume & issue: Vol. 21, no. 1
pp. 1 – 12

Abstract

Read online

Abstract Background Measurement of multimorbidity in research is variable, including the choice of the data source used to ascertain conditions. We compared the estimated prevalence of multimorbidity and associations with mortality using different data sources. Methods A cross-sectional study of SAIL Databank data including 2,340,027 individuals of all ages living in Wales on 01 January 2019. Comparison of prevalence of multimorbidity and constituent 47 conditions using data from primary care (PC), hospital inpatient (HI), and linked PC-HI data sources and examination of associations between condition count and 12-month mortality. Results Using linked PC-HI compared with only HI data, multimorbidity was more prevalent (32.2% versus 16.5%), and the population of people identified as having multimorbidity was younger (mean age 62.5 versus 66.8 years) and included more women (54.2% versus 52.6%). Individuals with multimorbidity in both PC and HI data had stronger associations with mortality than those with multimorbidity only in HI data (adjusted odds ratio 8.34 [95% CI 8.02-8.68] versus 6.95 (95%CI 6.79-7.12] in people with ≥ 4 conditions). The prevalence of conditions identified using only PC versus only HI data was significantly higher for 37/47 and significantly lower for 10/47: the highest PC/HI ratio was for depression (14.2 [95% CI 14.1–14.4]) and the lowest for aneurysm (0.51 [95% CI 0.5–0.5]). Agreement in ascertainment of conditions between the two data sources varied considerably, being slight for five (kappa < 0.20), fair for 12 (kappa 0.21–0.40), moderate for 16 (kappa 0.41–0.60), and substantial for 12 (kappa 0.61–0.80) conditions, and by body system was lowest for mental and behavioural disorders. The percentage agreement, individuals with a condition identified in both PC and HI data, was lowest in anxiety (4.6%) and highest in coronary artery disease (62.9%). Conclusions The use of single data sources may underestimate prevalence when measuring multimorbidity and many important conditions (especially mental and behavioural disorders). Caution should be used when interpreting findings of research examining individual and multiple long-term conditions using single data sources. Where available, researchers using electronic health data should link primary care and hospital inpatient data to generate more robust evidence to support evidence-based healthcare planning decisions for people with multimorbidity.

Published in BMC Medicine

ISSN: 1741-7015 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine
Website: http://bmcmedicine.biomedcentral.com

About the journal

Abstract

Keywords