The Lancet Regional Health. Americas (Feb 2022)

Factors associated with SARS-CoV-2 test positivity in long-term care homes: A population-based cohort analysis using machine learning

  • Douglas S. Lee,
  • Chloe X. Wang,
  • Finlay A. McAlister,
  • Shihao Ma,
  • Anna Chu,
  • Paula A. Rochon,
  • Padma Kaul,
  • Peter C. Austin,
  • Xuesong Wang,
  • Sunil V. Kalmady,
  • Jacob A. Udell,
  • Michael J. Schull,
  • Barry B. Rubin,
  • Bo Wang

Journal volume & issue
Vol. 6
p. 100146

Abstract

Read online

Summary: Background: SARS-Cov-2 infection rates are high among residents of long-term care (LTC) homes. We used machine learning to identify resident and community characteristics predictive of SARS-Cov-2 infection. Methods: We linked 26 population-based health and administrative databases to identify the population of all LTC residents tested for SARS-Cov-2 infection in Ontario, Canada. Using ensemble-based algorithms, we examined 484 factors, including individual-level demographics, healthcare use, comorbidities, functional status, and laboratory results; and community-level characteristics to identify factors predictive of infection. Analyses were performed separately for January to April (early wave 1) and May to August (late wave 1). Findings: Among 80,784 LTC residents, 64,757 (80.2%) were tested for SARS-Cov-2 (median age 86 (78–91) years, 30.6% male), of whom 10.2% of 33,519 and 5.2% of 31,238 tested positive in early and late wave 1, respectively. In the late phase (when restriction of visitors, closure of communal spaces, and universal masking in LTC were routine), regional-level characteristics comprised 33 of the top 50 factors associated with testing positive, while laboratory values and comorbidities were also predictive. The c-index of the final model was 0.934, and sensitivity was 0.887. In the highest versus lowest risk quartiles, the odds ratio for infection was 114.3 (95% CI 38.6–557.3). LTC-related geographic variations existed in the distribution of observed infection rates and the proportion of residents at highest risk. Interpretation: Machine learning informed evaluation of predicted and observed risks of SARS-CoV-2 infection at the resident and LTC levels, and may inform initiatives to improve care quality in this setting. Funding: Funded by a Canadian Institutes of Health Research, COVID-19 Rapid Research Funding Opportunity grant (# VR4 172736) and a Peter Munk Cardiac Centre Innovation Grant. Dr. D. Lee is the Ted Rogers Chair in Heart Function Outcomes, University Health Network, University of Toronto. Dr. Austin is supported by a Mid-Career investigator award from the Heart and Stroke Foundation. Dr. McAlister is supported by an Alberta Health Services Chair in Cardiovascular Outcomes Research. Dr. Kaul is the CIHR Sex and Gender Science Chair and the Heart & Stroke Chair in Cardiovascular Research. Dr. Rochon holds the RTO/ERO Chair in Geriatric Medicine from the University of Toronto. Dr. B. Wang holds a CIFAR AI chair at the Vector Institute.

Keywords