Self-Supervised Learning Methods for Label-Efficient Dental Caries Classification

Aiham Taleb; Csaba Rohrer; Benjamin Bergner; Guilherme De Leon; Jonas Almeida Rodrigues; Falk Schwendicke; Christoph Lippert; Joachim Krois

doi:10.3390/diagnostics12051237

Diagnostics (May 2022)

Self-Supervised Learning Methods for Label-Efficient Dental Caries Classification

Aiham Taleb,
Csaba Rohrer,
Benjamin Bergner,
Guilherme De Leon,
Jonas Almeida Rodrigues,
Falk Schwendicke,
Christoph Lippert,
Joachim Krois

Affiliations

Aiham Taleb: Digital Health & Machine Learning, Hasso Plattner Institute, University of Potsdam, 14469 Potsdam, Germany
Csaba Rohrer: Department of Oral Diagnostics, Digital Health and Health Services Research, Charité—Universitätsmedizin Berlin, 10117 Berlin, Germany
Benjamin Bergner: Digital Health & Machine Learning, Hasso Plattner Institute, University of Potsdam, 14469 Potsdam, Germany
Guilherme De Leon: Contraste Radiologia Odontológica, Blumenau 89010-050, SC, Brazil
Jonas Almeida Rodrigues: Department of Surgery and Orthopedics, School of Dentistry, Universidade Federal do Rio Grande do Sul—UFRGS, Porto Alegre 90010-460, RS, Brazil
Falk Schwendicke: Department of Oral Diagnostics, Digital Health and Health Services Research, Charité—Universitätsmedizin Berlin, 10117 Berlin, Germany
Christoph Lippert: Digital Health & Machine Learning, Hasso Plattner Institute, University of Potsdam, 14469 Potsdam, Germany
Joachim Krois: Department of Oral Diagnostics, Digital Health and Health Services Research, Charité—Universitätsmedizin Berlin, 10117 Berlin, Germany

DOI: https://doi.org/10.3390/diagnostics12051237
Journal volume & issue: Vol. 12, no. 5
p. 1237

Abstract

Read online

High annotation costs are a substantial bottleneck in applying deep learning architectures to clinically relevant use cases, substantiating the need for algorithms to learn from unlabeled data. In this work, we propose employing self-supervised methods. To that end, we trained with three self-supervised algorithms on a large corpus of unlabeled dental images, which contained 38K bitewing radiographs (BWRs). We then applied the learned neural network representations on tooth-level dental caries classification, for which we utilized labels extracted from electronic health records (EHRs). Finally, a holdout test-set was established, which consisted of 343 BWRs and was annotated by three dental professionals and approved by a senior dentist. This test-set was used to evaluate the fine-tuned caries classification models. Our experimental results demonstrate the obtained gains by pretraining models using self-supervised algorithms. These include improved caries classification performance (6 p.p. increase in sensitivity) and, most importantly, improved label-efficiency. In other words, the resulting models can be fine-tuned using few labels (annotations). Our results show that using as few as 18 annotations can produce ≥45% sensitivity, which is comparable to human-level diagnostic performance. This study shows that self-supervision can provide gains in medical image analysis, particularly when obtaining labels is costly and expensive.

Published in Diagnostics

ISSN: 2075-4418 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Medicine: Medicine (General)
Website: http://www.mdpi.com/journal/diagnostics

About the journal

Abstract

Keywords