Data-Centric Engineering (Jan 2023)

Embedding data science innovations in organizations: a new workflow approach

  • Keyao Li,
  • Mark A. Griffin,
  • Tamryn Barker,
  • Zane Prickett,
  • Melinda R. Hodkiewicz,
  • Jess Kozman,
  • Peta Chirgwin

DOI
https://doi.org/10.1017/dce.2023.22
Journal volume & issue
Vol. 4

Abstract

Read online

There have been consistent calls for more research on managing teams and embedding processes in data science innovations. Widely used frameworks (e.g., the cross-industry standard process for data mining) provide a standardized approach to data science but are limited in features such as role clarity, skills, and cross-team collaboration that are essential for developing organizational capabilities in data science. In this study, we introduce a data workflow method (DWM) as a new approach to break organizational silos and create a multi-disciplinary team to develop, implement and embed data science. Different from current data science process workflows, the DWM is managed at the system level that shapes business operating model for continuous improvement, rather than as a function of a particular project, one single business unit, or isolated individuals. To further operationalize the DWM approach, we investigated an embedded data workflow at a mining operation that has been using geological data in a machine-learning model to stabilize daily mill production for the last 2 years. Based on the findings in this study, we propose that the DWM approach derives its capability from three aspects: (a) a systemic data workflow; (b) multi-disciplinary networks of collaboration and responsibility; and (c) clearly identified data roles and the associated skills and expertise. This study suggests a whole-of-organization approach and pathway to develop data science capability.

Keywords