Machine learning for identification of frailty in Canadian primary care practices

Sylvia Aponte-Hao; Sabrina T. Wong; Manpreet Thandi; Paul Ronksley; Kerry  McBrien; Joon Lee; Mathew  Grandy; Dee Mangin; Alan Katz; Alexander  Singer; Donna Manca; Tyler Williamson

doi:10.23889/ijpds.v6i1.1650

International Journal of Population Data Science (Sep 2021)

Machine learning for identification of frailty in Canadian primary care practices

Sylvia Aponte-Hao,
Sabrina T. Wong,
Manpreet Thandi,
Paul Ronksley,
Kerry McBrien,
Joon Lee,
Mathew Grandy,
Dee Mangin,
Alan Katz,
Alexander Singer,
Donna Manca,
Tyler Williamson

Affiliations

Sylvia Aponte-Hao: University of Calgary, Cumming School of Medicine, 3330 Hospital Drive NW, Calgary, Alberta, T2N 4N1
Sabrina T. Wong: University of British Columbia, Centre for Health Services and Policy Research & School of Nursing, 2211 Wesbrook Mall, Vancouver, BC, V6T 2B5
Manpreet Thandi: University of British Columbia, Centre for Health Services and Policy Research & School of Nursing, 2211 Wesbrook Mall, Vancouver, BC, V6T 2B5
Paul Ronksley: University of Calgary, Cumming School of Medicine, 3330 Hospital Drive NW, Calgary, Alberta, T2N 4N1
Kerry McBrien: University of Calgary, Cumming School of Medicine, 3330 Hospital Drive NW, Calgary, Alberta, T2N 4N1
Joon Lee: University of Calgary, Cumming School of Medicine, 3330 Hospital Drive NW, Calgary, Alberta, T2N 4N1
Mathew Grandy: Department of Family Medicine, Dalhousie University, 1465 Brenton Street, Suite 402, Halifax, Nova Scotia, B3J 3T4
Dee Mangin: Department of Family Medicine, McMaster University, 1280 Main St W, Hamilton, ON, L8S 4L8
Alan Katz: College of Medicine Faculty of Health Sciences, University of Manitoba, 408-727 McDermot Ave, Winnipeg, Mb, R3E 3P5
Alexander Singer: Department of Family Medicine, University of Manitoba, 408-727 McDermot Ave, Winnipeg, Mb, R3E 3P5
Donna Manca: Department of Family Medicine, University of Alberta, 610 University Terrace, 8303 - 112 Street NW, Edmonton, Alberta, T6G 2T4
Tyler Williamson: University of Calgary, Cumming School of Medicine, 3330 Hospital Drive NW, Calgary, Alberta, T2N 4N1

DOI: https://doi.org/10.23889/ijpds.v6i1.1650
Journal volume & issue: Vol. 6, no. 1

Abstract

Read online

Introduction Frailty is a medical syndrome, commonly affecting people aged 65 years and over and is characterized by a greater risk of adverse outcomes following illness or injury. Electronic medical records contain a large amount of longitudinal data that can be used for primary care research. Machine learning can fully utilize this wide breadth of data for the detection of diseases and syndromes. The creation of a frailty case definition using machine learning may facilitate early intervention, inform advanced screening tests, and allow for surveillance. Objectives The objective of this study was to develop a validated case definition of frailty for the primary care context, using machine learning. Methods Physicians participating in the Canadian Primary Care Sentinel Surveillance Network across Canada were asked to retrospectively identify the level of frailty present in a sample of their own patients (total n = 5,466), collected from 2015-2019. Frailty levels were dichotomized using a cut-off of 5. Extracted features included previously prescribed medications, billing codes, and other routinely collected primary care data. We used eight supervised machine learning algorithms, with performance assessed using a hold-out test set. A balanced training dataset was also created by oversampling. Sensitivity analyses considered two alternative dichotomization cut-offs. Model performance was evaluated using area under the receiver-operating characteristic curve, F1, accuracy, sensitivity, specificity, negative predictive value and positive predictive value. Results The prevalence of frailty within our sample was 18.4%. Of the eight models developed to identify frail patients, an XGBoost model achieved the highest sensitivity (78.14%) and specificity (74.41%). The balanced training dataset did not improve classification performance. Sensitivity analyses did not show improved performance for cut-offs other than 5. Conclusion Supervised machine learning was able to create well performing classification models for frailty. Future research is needed to assess frailty inter-rater reliability, and link multiple data sources for frailty identification.

Published in International Journal of Population Data Science

ISSN: 2399-4908 (Online)
Publisher: Swansea University
Country of publisher: United Kingdom
LCC subjects: Social Sciences: Economic theory. Demography: Demography. Population. Vital events
Website: https://ijpds.org

About the journal