Journal of Statistics and Data Science Education (Jan 2025)

Generative AI for Data Science 101: Coding Without Learning to Code

  • Jacob Bien,
  • Gourab Mukherjee

DOI
https://doi.org/10.1080/26939169.2024.2432397

Abstract

Read online

Should one teach coding in a required introductory statistics and data science class for non-major students? Many professors advise against it, considering it a distraction from the important and challenging statistical topics that need to be covered. By contrast, other professors argue that the ability to interact flexibly with data will inspire students with a lasting love of the subject and a continued commitment to the material beyond the introductory course. With the release of large language models that write code, we saw an opportunity for a middle ground, which we tried in Fall 2023 in a required introductory data science course in our school’s full-time MBA program. We taught students how to write English prompts to the artificial intelligence tool GitHub Copilot that could be turned into R code and executed. In this short article, we report on our experience using this new approach.

Keywords