Challenges for the Repeatability of Deep Learning Models

Saeed S. Alahmari; Dmitry B. Goldgof; Peter R. Mouton; Lawrence O. Hall

doi:10.1109/ACCESS.2020.3039833

IEEE Access (Jan 2020)

Challenges for the Repeatability of Deep Learning Models

Saeed S. Alahmari,
Dmitry B. Goldgof,
Peter R. Mouton,
Lawrence O. Hall

Affiliations

Saeed S. Alahmari: ORCiD; Department of Computer Science and Engineering, University of South Florida, Tampa, FL, USA
Dmitry B. Goldgof: ORCiD; Department of Computer Science and Engineering, University of South Florida, Tampa, FL, USA
Peter R. Mouton: ORCiD; Department of Computer Science and Engineering, University of South Florida, Tampa, FL, USA
Lawrence O. Hall: ORCiD; Department of Computer Science and Engineering, University of South Florida, Tampa, FL, USA

DOI: https://doi.org/10.1109/ACCESS.2020.3039833
Journal volume & issue: Vol. 8
pp. 211860 – 211868

Abstract

Read online

Deep learning training typically starts with a random sampling initialization approach to set the weights of trainable layers. Therefore, different and/or uncontrolled weight initialization prevents learning the same model multiple times. Consequently, such models yield different results during testing. However, even with the exact same initialization for the weights, a lack of repeatability, replicability, and reproducibility may still be observed during deep learning for many reasons such as software versions, implementation variations, and hardware differences. In this article, we study repeatability when training deep learning models for segmentation and classification tasks using U-Net and LeNet-5 architectures in two development environments Pytorch and Keras (with TensorFlow backend). We show that even with the available control of randomization in Keras and TensorFlow, there are uncontrolled randomizations. We also show repeatable results for the same deep learning architectures using the Pytorch deep learning library. Finally, we discuss variations in the implementation of the weight initialization algorithm across deep learning libraries as a source of uncontrolled error in deep learning results.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords