PLoS ONE (Jan 2021)

Classification of masked image data.

  • Kamila Lis,
  • Mateusz Koryciński,
  • Konrad A Ciecierski

DOI
https://doi.org/10.1371/journal.pone.0254181
Journal volume & issue
Vol. 16, no. 7
p. e0254181

Abstract

Read online

Data classification is one of the most commonly used applications of machine learning. The are many developed algorithms that can work in various environments and for different data distributions that perform this task with excellence. Classification algorithms, just like other machine learning algorithms have one thing in common: in order to operate on data, they must see the data. In the present world, where concerns about privacy, GDPR (General Data Protection Regulation), business confidentiality and security are growing bigger and bigger; this requirement to work directly on the original data might become, in some situations, a burden. In this paper, an approach to the classification of images that cannot be directly accessed during training has been made. It has been shown that one can train a deep neural network to create such a representation of the original data that i) without additional information, the original data cannot be restored, and ii) that this representation-called a masked form-can still be used for classification purposes. Moreover, it has been shown that classification of the masked data can be done using both classical and neural network-based classifiers.