Multi‐modal object detection via transformer network

Wenbing Liu; Haibo Wang; Quanxue Gao; Zhaorui Zhu

doi:10.1049/ipr2.12884

IET Image Processing (Oct 2023)

Multi‐modal object detection via transformer network

Wenbing Liu,
Haibo Wang,
Quanxue Gao,
Zhaorui Zhu

Affiliations

Wenbing Liu: School of Telecommunications Engineering Xidian University Xian Shaanxi China
Haibo Wang: School of Telecommunications Engineering Xidian University Xian Shaanxi China
Quanxue Gao: School of Telecommunications Engineering Xidian University Xian Shaanxi China
Zhaorui Zhu: School of Telecommunications Engineering Xidian University Xian Shaanxi China

DOI: https://doi.org/10.1049/ipr2.12884
Journal volume & issue: Vol. 17, no. 12
pp. 3541 – 3550

Abstract

Read online

Abstract According to the fact that single‐modal data usually contain limited information, a great deal of effort has been devoted to making use of the complementary information contained in the multi‐modal data on various patterns. Thus, this paper is concerned with an object detection method that can fully utilize multi‐modal data. First, the method introduces the transformer mechanism to realize the fusion of intra‐modal and inter‐modal features of different modal data. The aim is to take advantage of the complementarity of data between modalities, which helps to improve the performance of multi‐modal object detection. Second, a contrastive loss suitable for contrastive learning is applied. This enables the authors to effectively utilize label information. Extensive experiments are conducted on multiple object detection datasets to demonstrate the effectiveness of our proposed method.

Published in IET Image Processing

ISSN: 1751-9659 (Print); 1751-9667 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Technology: Photography; Science: Mathematics: Instruments and machines: Electronic computers. Computer science: Computer software
Website: https://ietresearch.onlinelibrary.wiley.com/journal/17519667

About the journal

Abstract

Keywords