Informatics in Medicine Unlocked (Jan 2022)

Use of automatic SQL generation interface to enhance transparency and validity of health-data analysis

  • Kavishwar B. Wagholikar,
  • David Zelle,
  • Layne Ainsworth,
  • Kira Chaney,
  • Alexander J. Blood,
  • Angela Miller,
  • Rupendra Chulyadyo,
  • Michael Oates,
  • William J. Gordon,
  • Samuel J. Aronson,
  • Benjamin M. Scirica,
  • Shawn N. Murphy

Journal volume & issue
Vol. 31
p. 100996

Abstract

Read online

Analysis of health data typically requires development of queries using structured query language (SQL) by a data-analyst. As the SQL queries are manually created, they are prone to errors. In addition, accurate implementation of the queries depends on effective communication with clinical experts, that further makes the analysis error prone. As a potential resolution, we explore an alternative approach wherein a graphical interface that automatically generates the SQL queries is used to perform the analysis. The latter allows clinical experts to directly perform complex queries on the data, despite their unfamiliarity with SQL syntax. The interface provides an intuitive understanding of the query logic which makes the analysis transparent and comprehensible to the clinical study-staff, thereby enhancing the transparency and validity of the analysis. This study demonstrates the feasibility of using a user-friendly interface that automatically generate SQL for analysis of health data. It outlines challenges that will be useful for designing user-friendly tools to improve transparency and reproducibility of data analysis.

Keywords