Maximum Relevance Minimum Redundancy Dropout with Informative Kernel Determinantal Point Process

Mohsen Saffari; Mahdi Khodayar; Mohammad Saeed Ebrahimi Saadabadi; Ana F. Sequeira; Jaime S. Cardoso

doi:10.3390/s21051846

Sensors (Mar 2021)

Maximum Relevance Minimum Redundancy Dropout with Informative Kernel Determinantal Point Process

Mohsen Saffari,
Mahdi Khodayar,
Mohammad Saeed Ebrahimi Saadabadi,
Ana F. Sequeira,
Jaime S. Cardoso

Affiliations

Mohsen Saffari: INESC TEC and Faculty of Engineering, University of Porto, 4200-465 Porto, Portugal
Mahdi Khodayar: Department of Computer Science, University of Tulsa, Tulsa, OK 74104, USA
Mohammad Saeed Ebrahimi Saadabadi: Faculty of Electrical Engineering, K. N. Toosi University of Technology, Tehran 16315-1355, Iran
Ana F. Sequeira: INESC TEC, 4200-465 Porto, Portugal
Jaime S. Cardoso: INESC TEC and Faculty of Engineering, University of Porto, 4200-465 Porto, Portugal

DOI: https://doi.org/10.3390/s21051846
Journal volume & issue: Vol. 21, no. 5
p. 1846

Abstract

Read online

In recent years, deep neural networks have shown significant progress in computer vision due to their large generalization capacity; however, the overfitting problem ubiquitously threatens the learning process of these highly nonlinear architectures. Dropout is a recent solution to mitigate overfitting that has witnessed significant success in various classification applications. Recently, many efforts have been made to improve the Standard dropout using an unsupervised merit-based semantic selection of neurons in the latent space. However, these studies do not consider the task-relevant information quality and quantity and the diversity of the latent kernels. To solve the challenge of dropping less informative neurons in deep learning, we propose an efficient end-to-end dropout algorithm that selects the most informative neurons with the highest correlation with the target output considering the sparsity in its selection procedure. First, to promote activation diversity, we devise an approach to select the most diverse set of neurons by making use of determinantal point process (DPP) sampling. Furthermore, to incorporate task specificity into deep latent features, a mutual information (MI)-based merit function is developed. Leveraging the proposed MI with DPP sampling, we introduce the novel DPPMI dropout that adaptively adjusts the retention rate of neurons based on their contribution to the neural network task. Empirical studies on real-world classification benchmarks including, MNIST, SVHN, CIFAR10, CIFAR100, demonstrate the superiority of our proposed method over recent state-of-the-art dropout algorithms in the literature.

Published in Sensors

ISSN: 1424-8220 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Chemical technology
Website: http://www.mdpi.com/journal/sensors

About the journal

Abstract

Keywords