Applying Self-Supervised Learning to Medicine: Review of the State of the Art and Medical Implementations

Alexander Chowdhury; Jacob Rosenthal; Jonathan Waring; Renato Umeton

doi:10.3390/informatics8030059

Informatics (Sep 2021)

Applying Self-Supervised Learning to Medicine: Review of the State of the Art and Medical Implementations

Alexander Chowdhury,
Jacob Rosenthal,
Jonathan Waring,
Renato Umeton

Affiliations

Alexander Chowdhury: Department of Informatics & Analytics, Dana-Farber Cancer Institute, Boston, MA 02215, USA
Jacob Rosenthal: Department of Informatics & Analytics, Dana-Farber Cancer Institute, Boston, MA 02215, USA
Jonathan Waring: Department of Informatics & Analytics, Dana-Farber Cancer Institute, Boston, MA 02215, USA
Renato Umeton: Department of Informatics & Analytics, Dana-Farber Cancer Institute, Boston, MA 02215, USA

DOI: https://doi.org/10.3390/informatics8030059
Journal volume & issue: Vol. 8, no. 3
p. 59

Abstract

Read online

Machine learning has become an increasingly ubiquitous technology, as big data continues to inform and influence everyday life and decision-making. Currently, in medicine and healthcare, as well as in most other industries, the two most prevalent machine learning paradigms are supervised learning and transfer learning. Both practices rely on large-scale, manually annotated datasets to train increasingly complex models. However, the requirement of data to be manually labeled leaves an excess of unused, unlabeled data available in both public and private data repositories. Self-supervised learning (SSL) is a growing area of machine learning that can take advantage of unlabeled data. Contrary to other machine learning paradigms, SSL algorithms create artificial supervisory signals from unlabeled data and pretrain algorithms on these signals. The aim of this review is two-fold: firstly, we provide a formal definition of SSL, divide SSL algorithms into their four unique subsets, and review the state of the art published in each of those subsets between the years of 2014 and 2020. Second, this work surveys recent SSL algorithms published in healthcare, in order to provide medical experts with a clearer picture of how they can integrate SSL into their research, with the objective of leveraging unlabeled data.

Published in Informatics

ISSN: 2227-9709 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Information technology
Website: http://www.mdpi.com/journal/informatics

About the journal

Abstract

Keywords