Deep Multi-Level Semantic Hashing for Cross-Modal Retrieval

Zhenyan Ji; Weina Yao; Wei Wei; Houbing Song; Huaiyu Pi

doi:10.1109/ACCESS.2019.2899536

IEEE Access (Jan 2019)

Deep Multi-Level Semantic Hashing for Cross-Modal Retrieval

Zhenyan Ji,
Weina Yao,
Wei Wei,
Houbing Song,
Huaiyu Pi

Affiliations

Zhenyan Ji: School of Software Engineering, Beijing Jiaotong University, Beijing, China
Weina Yao: School of Software Engineering, Beijing Jiaotong University, Beijing, China
Wei Wei: School of Computer Science and Engineering, Xi’an University of Technology, Xi’an, China
Houbing Song: ORCiD; Department of Electrical, Computer, Software, and Systems Engineering, Embry-Riddle Aeronautical University, Daytona Beach, FL, USA
Huaiyu Pi: School of Software Engineering, Beijing Jiaotong University, Beijing, China

DOI: https://doi.org/10.1109/ACCESS.2019.2899536
Journal volume & issue: Vol. 7
pp. 23667 – 23674

Abstract

Read online

With the rapid growth of multimodal data, the cross-modal search has widely attracted research interests. Due to its efficiency on storage and computing, hashing-based methods are broadly used for large scale cross-modal retrieval. Most existing hashing methods are designed based on binary supervision, which transforms complex relationships of multi-label data into simple similar or dissimilar. However, few methods have explored the rich semantic information implicit in multi-label data to improve the accuracy of searching results. In this paper, the multi-level semantic supervision generating approach is proposed by exploring the label relevance. And a deep hashing framework is designed for multi-label image-text cross retrieval tasks. It can simultaneously capture the binary similarity and the complex multi-level semantic structure of data in different forms. Moreover, the effects of three different convolutional neural networks, CNN-F, VGG-16, and ResNet-50, on the retrieval results are compared. The experimental results on an open source cross-modal dataset show that our approach outperforms several state-of-the-art hashing methods, and the retrieval result on the CNN-F network is better than VGG-16 and ResNet-50.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords