Deep Neural Networks Classification via Binary Error-Detecting Output Codes

Martin Klimo; Peter Lukáč; Peter Tarábek

doi:10.3390/app11083563

Applied Sciences (Apr 2021)

Deep Neural Networks Classification via Binary Error-Detecting Output Codes

Martin Klimo,
Peter Lukáč,
Peter Tarábek

Affiliations

Martin Klimo: Faculty of Management Sciences and Informatics, University of Žilina, 01026 Žilina, Slovakia
Peter Lukáč: Faculty of Management Sciences and Informatics, University of Žilina, 01026 Žilina, Slovakia
Peter Tarábek: Faculty of Management Sciences and Informatics, University of Žilina, 01026 Žilina, Slovakia

DOI: https://doi.org/10.3390/app11083563
Journal volume & issue: Vol. 11, no. 8
p. 3563

Abstract

Read online

One-hot encoding is the prevalent method used in neural networks to represent multi-class categorical data. Its success stems from its ease of use and interpretability as a probability distribution when accompanied by a softmax activation function. However, one-hot encoding leads to very high dimensional vector representations when the categorical data’s cardinality is high. The Hamming distance in one-hot encoding is equal to two from the coding theory perspective, which does not allow detection or error-correcting capabilities. Binary coding provides more possibilities for encoding categorical data into the output codes, which mitigates the limitations of the one-hot encoding mentioned above. We propose a novel method based on Zadeh fuzzy logic to train binary output codes holistically. We study linear block codes for their possibility of separating class information from the checksum part of the codeword, showing their ability not only to detect recognition errors by calculating non-zero syndrome, but also to evaluate the truth-value of the decision. Experimental results show that the proposed approach achieves similar results as one-hot encoding with a softmax function in terms of accuracy, reliability, and out-of-distribution performance. It suggests a good foundation for future applications, mainly classification tasks with a high number of classes.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords