Journal of Statistics and Data Science Education (May 2025)
A Systematic Literature Review of Undergraduate Data Science Education Research
Abstract
The presence of data science has been profound in the scientific community in almost every discipline. An important part of the data science education expansion has been at the undergraduate level. We conducted a systematic literature review to (a) portray current evidence and knowledge gaps in self-proclaimed undergraduate data science education research and (b) inform policymakers and the data science education community about what educators may encounter when searching for literature using the general keyword “data science education.” While open-access publications that target a broader audience of data science educators and include multiple examples of data science programs and courses are a strength, substantial knowledge gaps remain. The undergraduate data science literature that we identified often lacks empirical data, research questions, and reproducibility. Certain disciplines are less visible. We recommend that we should (a) cherish data science as an interdisciplinary field; (b) adopt a consistent set of keywords/terminology to ensure data science education literature is easily identifiable; (c) prioritize investments in empirical studies.
Keywords