TFITrack: Transformer Feature Integration Network for Object Tracking

Xiuhua Hu; Huan Liu; Shuang Li; Jing Zhao; Yan Hui

doi:10.1007/s44196-024-00500-0

International Journal of Computational Intelligence Systems (Apr 2024)

TFITrack: Transformer Feature Integration Network for Object Tracking

Xiuhua Hu,
Huan Liu,
Shuang Li,
Jing Zhao,
Yan Hui

Affiliations

Xiuhua Hu: School of Computer Science and Engineering, Xi’an Technological University
Huan Liu: School of Computer Science and Engineering, Xi’an Technological University
Shuang Li: School of Computer Science and Engineering, Xi’an Technological University
Jing Zhao: School of Computer Science and Engineering, Xi’an Technological University
Yan Hui: School of Computer Science and Engineering, Xi’an Technological University

DOI: https://doi.org/10.1007/s44196-024-00500-0
Journal volume & issue: Vol. 17, no. 1
pp. 1 – 20

Abstract

Read online

Abstract Due to the ignoring of rich spatio-temporal and global contextual information with convolutional neural networks in features extraction, the traditional method is prone to tracking drift or even failure in complex scenario, especially for the tiny targets in aerial photography scenario. In this work, it proposes a transformer feature integration network (TFITrack) to obtain diverse and comprehensive target feature for the robust object tracking. Based on the typical transformer architecture, it optimizes encoder and decoder structure for aggregating discriminative spatio-temporal information and global context-awareness feature. Furthermore, the encoder introduces the similarity calculation layer and dual-attention module; the aim is to deepen the similarity between features and make corrections for channel and spatial dimensions, and feature representation is improved. Finally, with the introduction of the temporal context filtering layer, unimportant feature information is ignored adaptively, obtaining a balance between the parameters number reduction and stable performance. Experimental results show that the proposed tracking algorithm exhibits excellent tracking performance on seven benchmark datasets, especially on the aerial dataset UAV123, UAV20L, and UAV123@10fps, which presents the advantages of the novel method in dealing with fast motion and external interference.

Published in International Journal of Computational Intelligence Systems

ISSN: 1875-6891 (Print); 1875-6883 (Online)
Publisher: Springer
Country of publisher: Switzerland
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.springer.com/journal/44196

About the journal

Abstract

Keywords