Радіоелектронні і комп'ютерні системи (Mar 2019)
METHOD OF UNSUPERVISED LEARNING OF HIERARCHICAL EXTRACTOR OF VISUAL FEATURES BASED ON MODIFICATION OF NEURAL GAS
Abstract
The modern technologies of the intellectual analysis of visual information for solving the problem of unsupervised training in real time with the aim of adapting to unknown conditions of observation are analyzed. It is proposed to use 10 layers of the well-known neural network VGG-16 as a model of the hierarchical extractor of visual features that can be used in the transfer learning tasks. The use of the principles of the neural gas to increase the convergence rate of the algorithm of usupervised learning of the extractor of visual features under the conditions of a limited amount of training data is considered. The modification of the neuron gas aimed to sparse coding of input observations is based on the optimized orthogonal matching pursuit algorithm that was used to increase the informativeness of the feature set in condition of limited sample size. Training dataset is generated by selecting from a popular image base ImageNet and selecting patches from selected images or feature maps on a given layer. The method of so-called information-extreme machine learning of decision rules is proposed for assessing the efficiency of the proposed feature extractor. Information-extreme learning is based on the use of binary coding of the feature representation of observations and the construction of radial-basic decision rules in Hamming's binary space. The implementation of the algorithm is based on the use of computationally simple operations such comparation with threshold and a bitwise XOR. Optimization of the geometric parameters of the partition feature space into separated classes is carried out in the binary space, therefore, it can be implemented by the method of a sequential direct busting with a given step, since such steps are relatively small. For optimizing parameters of encoding observations rules is used population-based particle swarm algorithm for searching global maximum of logarithmic information Kullback’s criterion in admissible domain of it function. In this case we normalized modification information criterion which is function of the first and second kind errors is used. The effectiveness of training of decision rules in the case of the use of an extractor supervise trained with by a stochastic gradient descent method, with case of supervised trained feature extractor is compared. According to the results of physical modeling unsupervised learning of extractor ensures the accuracy of decisive rules to 96.4% which is inferior to the accuracy of supervised learning which is equal to 98.7% are shown.
Keywords