An operational guide to translational clinical machine learning in academic medical centers

Mukund Poddar; Jayson S. Marwaha; William Yuan; Santiago Romero-Brufau; Gabriel A. Brat

doi:10.1038/s41746-024-01094-9

npj Digital Medicine (May 2024)

An operational guide to translational clinical machine learning in academic medical centers

Mukund Poddar,
Jayson S. Marwaha,
William Yuan,
Santiago Romero-Brufau,
Gabriel A. Brat

Affiliations

Mukund Poddar: Department of Biostatistics, Harvard T. H. Chan School of Public Health
Jayson S. Marwaha: Department of Surgery, Beth Israel Deaconess Medical Center
William Yuan: Department of Surgery, Beth Israel Deaconess Medical Center
Santiago Romero-Brufau: Department of Biostatistics, Harvard T. H. Chan School of Public Health
Gabriel A. Brat: Department of Surgery, Beth Israel Deaconess Medical Center

DOI: https://doi.org/10.1038/s41746-024-01094-9
Journal volume & issue: Vol. 7, no. 1
pp. 1 – 7

Abstract

Read online

Abstract Few published data science tools are ever translated from academia to real-world clinical settings for which they were intended. One dimension of this problem is the software engineering task of turning published academic projects into tools that are usable at the bedside. Given the complexity of the data ecosystem in large health systems, this task often represents a significant barrier to the real-world deployment of data science tools for prospective piloting and evaluation. Many information technology companies have created Machine Learning Operations (MLOps) teams to help with such tasks at scale, but the low penetration of home-grown data science tools in regular clinical practice precludes the formation of such teams in healthcare organizations. Based on experiences deploying data science tools at two large academic medical centers (Beth Israel Deaconess Medical Center, Boston, MA; Mayo Clinic, Rochester, MN), we propose a strategy to facilitate this transition from academic product to operational tool, defining the responsibilities of the principal investigator, data scientist, machine learning engineer, health system IT administrator, and clinician end-user throughout the process. We first enumerate the technical resources and stakeholders needed to prepare for model deployment. We then propose an approach to planning how the final product will work from data extraction and analysis to visualization of model outputs. Finally, we describe how the team should execute on this plan. We hope to guide health systems aiming to deploy minimum viable data science tools and realize their value in clinical practice.

Published in npj Digital Medicine

ISSN: 2398-6352 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: https://www.nature.com/npjdigitalmed/

About the journal