Data-driven hypothesis generation among inexperienced clinical researchers: A comparison of secondary data analyses with visualization (VIADS) and other tools

Xia Jing; James J. Cimino; Vimla L. Patel; Yuchun Zhou; Jay H. Shubrook; Sonsoles De Lacalle; Brooke N. Draghi; Mytchell A. Ernst; Aneesa Weaver; Shriram Sekar; Chang Liu

doi:10.1017/cts.2023.708

Journal of Clinical and Translational Science (Jan 2024)

Data-driven hypothesis generation among inexperienced clinical researchers: A comparison of secondary data analyses with visualization (VIADS) and other tools

Xia Jing,
James J. Cimino,
Vimla L. Patel,
Yuchun Zhou,
Jay H. Shubrook,
Sonsoles De Lacalle,
Brooke N. Draghi,
Mytchell A. Ernst,
Aneesa Weaver,
Shriram Sekar,
Chang Liu

Affiliations

Xia Jing: ORCiD; Department of Public Health Sciences, College of Behavioral, Social and Health Sciences, Clemson University, Clemson, SC, USA
James J. Cimino: ORCiD; Informatics Institute, School of Medicine, University of Alabama, Birmingham, AL, USA
Vimla L. Patel: ORCiD; Cognitive Studies in Medicine and Public Health, The New York Academy of Medicine, New York City, NY, USA
Yuchun Zhou: ORCiD; Department of Educational Studies, The Patton College of Education, Ohio University, Athens, OH, USA
Jay H. Shubrook: ORCiD; Department of Clinical Sciences and Community Health, College of Osteopathic Medicine, Touro University California, Vallejo, CA, USA
Sonsoles De Lacalle: ORCiD; Department of Health Science, California State University Channel Islands, Camarillo, CA, USA
Brooke N. Draghi: ORCiD; Department of Public Health Sciences, College of Behavioral, Social and Health Sciences, Clemson University, Clemson, SC, USA
Mytchell A. Ernst: ORCiD; Department of Public Health Sciences, College of Behavioral, Social and Health Sciences, Clemson University, Clemson, SC, USA
Aneesa Weaver: Department of Public Health Sciences, College of Behavioral, Social and Health Sciences, Clemson University, Clemson, SC, USA
Shriram Sekar: Electrical Engineering and Computer Science, Russ College of Engineering and Technology, Ohio University, Athens, OH, USA
Chang Liu: ORCiD; Russ College of Engineering and Technology, Ohio University, Athens, OH, USA

DOI: https://doi.org/10.1017/cts.2023.708
Journal volume & issue: Vol. 8

Abstract

Read online

Abstract Objectives: To compare how clinical researchers generate data-driven hypotheses with a visual interactive analytic tool (VIADS, a visual interactive analysis tool for filtering and summarizing large datasets coded with hierarchical terminologies) or other tools. Methods: We recruited clinical researchers and separated them into “experienced” and “inexperienced” groups. Participants were randomly assigned to a VIADS or control group within the groups. Each participant conducted a remote 2-hour study session for hypothesis generation with the same study facilitator on the same datasets by following a think-aloud protocol. Screen activities and audio were recorded, transcribed, coded, and analyzed. Hypotheses were evaluated by seven experts on their validity, significance, and feasibility. We conducted multilevel random effect modeling for statistical tests. Results: Eighteen participants generated 227 hypotheses, of which 147 (65%) were valid. The VIADS and control groups generated a similar number of hypotheses. The VIADS group took a significantly shorter time to generate one hypothesis (e.g., among inexperienced clinical researchers, 258 s versus 379 s, p = 0.046, power = 0.437, ICC = 0.15). The VIADS group received significantly lower ratings than the control group on feasibility and the combination rating of validity, significance, and feasibility. Conclusion: The role of VIADS in hypothesis generation seems inconclusive. The VIADS group took a significantly shorter time to generate each hypothesis. However, the combined validity, significance, and feasibility ratings of their hypotheses were significantly lower. Further characterization of hypotheses, including specifics on how they might be improved, could guide future tool development.

Published in Journal of Clinical and Translational Science

ISSN: 2059-8661 (Online)
Publisher: Cambridge University Press
Country of publisher: United Kingdom
LCC subjects: Medicine
Website: https://www.cambridge.org/core/journals/journal-of-clinical-and-translational-science

About the journal

Abstract

Keywords