International Journal of Population Data Science (Sep 2024)

Mother and Infant Research Electronic Data Analysis (MIREDA): A protocol for creating a common data model for federated analysis of UK birth cohorts and the life course

  • Michael Seaborne,
  • Hope Jones,
  • Neil Cockburn,
  • Stevo Durbaba,
  • Arturo González-Izquierdo,
  • Amy Hough,
  • Dan Mason,
  • Carlos Sánchez-Soriano,
  • Chris Orton,
  • Armando Méndez-Villalon,
  • Tom Giles,
  • David Ford,
  • Phillip Quinlan,
  • Krish Nirantharakumar,
  • Lucilla Poston,
  • Rebecca Reynolds,
  • Gillian Santorelli,
  • Sinead Brophy

DOI
https://doi.org/10.23889/ijpds.v9i2.2406
Journal volume & issue
Vol. 9, no. 2

Abstract

Read online

Introduction Birth cohorts are valuable resources for studying early life, the determinants of health, disease, and development. They are essential for studying life course. Dynamic longitudinal electronic cohorts use routinely collected data, are live, and can reduce selection bias specifically associated with direct recruitment in traditional birth cohorts. However, they are limited to health and administrative data and may lack contextual information. The MIREDA (Mother and Infant Research Electronic Data Analysis) partnership creates a UK-wide birth cohort by aligning existing electronic birth cohorts to have the same structure, content, and vocabularies, enabling UK-wide federated analyses. Objectives 1) Create a core dynamic, live UK-wide electronic birth cohort with approximately 500,000 new births per year using a common data model (CDM). 2) Provide data linkage and automation for long-term follow up of births from the Clinical Practice Research Datalink (CPRD), MuM-PreDiCT and the `Born in' initiatives of Bradford, Wales, Scotland, and South London for comparable analyses. Methods We will establish core data content and collate linkable data. A suite of extraction, transformation, and load (ETL) tools will be used to transform data for each birth cohort into the CDM. Transformed datasets will remain within each cohort's trusted research environment (TRE). Metadata will be uploaded for the public to the Health Data Research (HDRUK) Innovation Gateway. We will develop a single online data access request for researchers. A cohort profile will be developed for researchers to reference the resource. Ethics Each cohort has approval from their TRE through compliance with their project application processes and information governance. Dissemination We will engage with researchers in the field to promote our resource through partnership networking, publication, research collaborations, conferences, social media, and marketing communications strategies.

Keywords