BMC Medical Informatics and Decision Making (Aug 2022)

Leveraging artificial intelligence and data science techniques in harmonizing, sharing, accessing and analyzing SARS-COV-2/COVID-19 data in Rwanda (LAISDAR Project): study design and rationale

  • Aurore Nishimwe,
  • Charles Ruranga,
  • Clarisse Musanabaganwa,
  • Regine Mugeni,
  • Muhammed Semakula,
  • Joseph Nzabanita,
  • Ignace Kabano,
  • Annie Uwimana,
  • Jean N. Utumatwishima,
  • Jean Damascene Kabakambira,
  • Annette Uwineza,
  • Lars Halvorsen,
  • Freija Descamps,
  • Jared Houghtaling,
  • Benjamin Burke,
  • Odile Bahati,
  • Clement Bizimana,
  • Stefan Jansen,
  • Celestin Twizere,
  • Kizito Nkurikiyeyezu,
  • Francine Birungi,
  • Sabin Nsanzimana,
  • Marc Twagirumukiza

DOI
https://doi.org/10.1186/s12911-022-01965-9
Journal volume & issue
Vol. 22, no. 1
pp. 1 – 9

Abstract

Read online

Abstract Background Since the outbreak of COVID-19 pandemic in Rwanda, a vast amount of SARS-COV-2/COVID-19-related data have been collected including COVID-19 testing and hospital routine care data. Unfortunately, those data are fragmented in silos with different data structures or formats and cannot be used to improve understanding of the disease, monitor its progress, and generate evidence to guide prevention measures. The objective of this project is to leverage the artificial intelligence (AI) and data science techniques in harmonizing datasets to support Rwandan government needs in monitoring and predicting the COVID-19 burden, including the hospital admissions and overall infection rates. Methods The project will gather the existing data including hospital electronic health records (EHRs), the COVID-19 testing data and will link with longitudinal data from community surveys. The open-source tools from Observational Health Data Sciences and Informatics (OHDSI) will be used to harmonize hospital EHRs through the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM). The project will also leverage other OHDSI tools for data analytics and network integration, as well as R Studio and Python. The network will include up to 15 health facilities in Rwanda, whose EHR data will be harmonized to OMOP CDM. Expected results This study will yield a technical infrastructure where the 15 participating hospitals and health centres will have EHR data in OMOP CDM format on a local Mac Mini (“data node”), together with a set of OHDSI open-source tools. A central server, or portal, will contain a data catalogue of participating sites, as well as the OHDSI tools that are used to define and manage distributed studies. The central server will also integrate the information from the national Covid-19 registry, as well as the results of the community surveys. The ultimate project outcome is the dynamic prediction modelling for COVID-19 pandemic in Rwanda. Discussion The project is the first on the African continent leveraging AI and implementation of an OMOP CDM based federated data network for data harmonization. Such infrastructure is scalable for other pandemics monitoring, outcomes predictions, and tailored response planning.

Keywords