Hybrid Attention Network for Language-Based Person Search

Yang Li; Huahu Xu; Junsheng Xiao

doi:10.3390/s20185279

Sensors (Sep 2020)

Hybrid Attention Network for Language-Based Person Search

Yang Li,
Huahu Xu,
Junsheng Xiao

Affiliations

Yang Li: School of Computer Engineering and Science, Shanghai University, Shanghai 200444, China
Huahu Xu: School of Computer Engineering and Science, Shanghai University, Shanghai 200444, China
Junsheng Xiao: School of Computer Engineering and Science, Shanghai University, Shanghai 200444, China

DOI: https://doi.org/10.3390/s20185279
Journal volume & issue: Vol. 20, no. 18
p. 5279

Abstract

Read online

Language-based person search retrieves images of a target person using natural language description and is a challenging fine-grained cross-modal retrieval task. A novel hybrid attention network is proposed for the task. The network includes the following three aspects: First, a cubic attention mechanism for person image, which combines cross-layer spatial attention and channel attention. It can fully excavate both important midlevel details and key high-level semantics to obtain better discriminative fine-grained feature representation of a person image. Second, a text attention network for language description, which is based on bidirectional LSTM (BiLSTM) and self-attention mechanism. It can better learn the bidirectional semantic dependency and capture the key words of sentences, so as to extract the context information and key semantic features of the language description more effectively and accurately. Third, a cross-modal attention mechanism and a joint loss function for cross-modal learning, which can pay more attention to the relevant parts between text and image features. It can better exploit both the cross-modal and intra-modal correlation and can better solve the problem of cross-modal heterogeneity. Extensive experiments have been conducted on the CUHK-PEDES dataset. Our approach obtains higher performance than state-of-the-art approaches, demonstrating the advantage of the approach we propose.

Published in Sensors

ISSN: 1424-8220 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Chemical technology
Website: http://www.mdpi.com/journal/sensors

About the journal

Abstract

Keywords