Children (Dec 2022)

Improving Cohort-Hospital Matching Accuracy through Standardization and Validation of Participant Identifiable Information

  • Yanhong Jessika Hu,
  • Anna Fedyukova,
  • Jing Wang,
  • Joanne M. Said,
  • Niranjan Thomas,
  • Elizabeth Noble,
  • Jeanie L. Y. Cheong,
  • Bill Karanatsios,
  • Sharon Goldfeld,
  • Melissa Wake

DOI
https://doi.org/10.3390/children9121916
Journal volume & issue
Vol. 9, no. 12
p. 1916

Abstract

Read online

Linking very large, consented birth cohorts to birthing hospitals clinical data could elucidate the lifecourse outcomes of health care and exposures during the pregnancy, birth and newborn periods. Unfortunately, cohort personally identifiable information (PII) often does not include unique identifier numbers, presenting matching challenges. To develop optimized cohort matching to birthing hospital clinical records, this pilot drew on a one-year (December 2020–December 2021) cohort for a single Australian birthing hospital participating in the whole-of-state Generation Victoria (GenV) study. For 1819 consented mother-baby pairs and 58 additional babies (whose mothers were not themselves participating), we tested the accuracy and effort of various approaches to matching. We selected demographic variables drawn from names, DOB, sex, telephone, address (and birth order for multiple births). After variable standardization and validation, accuracy rose from 10% to 99% using a deterministic-rule-based approach in 10 steps. Using cohort-specific modifications of the Australian Statistical Linkage Key (SLK-581), it took only 3 steps to reach 97% (SLK-5881) and 98% (SLK-5881.1) accuracy. We conclude that our SLK-5881 process could safely and efficiently achieve high accuracy at the population level for future birth cohort-birth hospital matching in the absence of unique identifier numbers.

Keywords