Automated machine learning with interpretation: A systematic review of methodologies and applications in healthcare

Han Yuan; Kunyu Yu; Feng Xie; Mingxuan Liu; Shenghuan Sun

doi:10.1002/med4.75

Medicine Advances (Sep 2024)

Automated machine learning with interpretation: A systematic review of methodologies and applications in healthcare

Han Yuan,
Kunyu Yu,
Feng Xie,
Mingxuan Liu,
Shenghuan Sun

Affiliations

Han Yuan: Duke‐NUS Medical School Centre for Quantitative Medicine Singapore Singapore
Kunyu Yu: Duke‐NUS Medical School Centre for Quantitative Medicine Singapore Singapore
Feng Xie: Duke‐NUS Medical School Centre for Quantitative Medicine Singapore Singapore
Mingxuan Liu: Duke‐NUS Medical School Centre for Quantitative Medicine Singapore Singapore
Shenghuan Sun: Bakar Computational Health Sciences Institute University of California San Francisco California USA

DOI: https://doi.org/10.1002/med4.75
Journal volume & issue: Vol. 2, no. 3
pp. 205 – 237

Abstract

Read online

Abstract Machine learning (ML) has achieved substantial success in performing healthcare tasks in which the configuration of every part of the ML pipeline relies heavily on technical knowledge. To help professionals with borderline expertise to better use ML techniques, Automated ML (AutoML) has emerged as a prospective solution. However, most models generated by AutoML are black boxes that are challenging to comprehend and deploy in healthcare settings. We conducted a systematic review to examine AutoML with interpretation systems for healthcare. We searched four databases (MEDLINE, EMBASE, Web of Science, and Scopus) complemented with seven prestigious ML conferences (AAAI, ACL, ICLR, ICML, IJCAI, KDD, and NeurIPS) that reported AutoML with interpretation for healthcare before September 1, 2023. We included 118 articles related to AutoML with interpretation in healthcare. First, we illustrated AutoML techniques used in the included publications, including automated data preparation, automated feature engineering, and automated model development, accompanied by a real‐world case study to demonstrate the advantages of AutoML over classic ML. Then, we summarized interpretation methods: feature interaction and importance, data dimensionality reduction, intrinsically interpretable models, and knowledge distillation and rule extraction. Finally, we detailed how AutoML with interpretation has been used for six major data types: image, free text, tabular data, signal, genomic sequences, and multi‐modality. To some extent, AutoML with interpretation provides effortless development and improves users' trust in ML in healthcare settings. In future studies, researchers should explore automated data preparation, seamless integration of automation and interpretation, compatibility with multi‐modality, and utilization of foundation models.

Published in Medicine Advances

ISSN: 2834-4391 (Print); 2834-4405 (Online)
Publisher: Wiley
Country of publisher: Australia
LCC subjects: Medicine
Website: https://onlinelibrary.wiley.com/journal/28344405

About the journal

Abstract

Keywords