Asymmetric Deep Semantic Quantization for Image Retrieval

Zhan Yang; Osolo Ian Raymond; Wuqing Sun; Jun Long

doi:10.1109/ACCESS.2019.2920712

IEEE Access (Jan 2019)

Asymmetric Deep Semantic Quantization for Image Retrieval

Zhan Yang,
Osolo Ian Raymond,
Wuqing Sun,
Jun Long

Affiliations

Zhan Yang: ORCiD; Network Resources Management and Trust Evaluation Key Laboratory of Hunan Province, School of Computer Science and Engineering, Central South University, Changsha, China
Osolo Ian Raymond: ORCiD; Network Resources Management and Trust Evaluation Key Laboratory of Hunan Province, School of Computer Science and Engineering, Central South University, Changsha, China
Wuqing Sun: Network Resources Management and Trust Evaluation Key Laboratory of Hunan Province, School of Computer Science and Engineering, Central South University, Changsha, China
Jun Long: Network Resources Management and Trust Evaluation Key Laboratory of Hunan Province, School of Computer Science and Engineering, Central South University, Changsha, China

DOI: https://doi.org/10.1109/ACCESS.2019.2920712
Journal volume & issue: Vol. 7
pp. 72684 – 72695

Abstract

Read online

Due to its fast retrieval and storage efficiency capabilities, hashing has been widely used in nearest neighbor retrieval tasks. By using deep learning-based techniques, hashing can outperform non-learning-based hashing technique in many applications. However, we argue that the current deep learning-based hashing methods ignore some critical problems (e.g., the learned hash codes are not discriminative due to the hashing methods being unable to discover rich semantic information and the training strategy having difficulty optimizing the discrete binary codes). In this paper, we propose a novel image hashing method, termed as asymmetric deep semantic quantization (ADSQ). The ADSQ is implemented using three stream frameworks, which consist of one LabelNet and two ImgNets. The LabelNet leverages the power of three fully-connected layers, which are used to capture rich semantic information between image pairs. For the two ImgNets, they each adopt the same convolutional neural network structure but with different weights (i.e., asymmetric convolutional neural networks). The two ImgNets are used to generate discriminative compact hash codes. Specifically, the function of the LabelNet is to capture rich semantic information that is used to guide the two ImgNets in minimizing the gap between the real-continuous features and the discrete binary codes. Furthermore, the ADSQ can utilize the most critical semantic information to guide the feature learning process and consider the consistency of the common semantic space and Hamming space. The experimental results on three benchmarks (i.e., CIFAR-10, NUS-WIDE, and ImageNet) demonstrate that the proposed ADSQ can outperform current state-of-the-art methods.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords