TLtrack: Combining Transformers and a Linear Model for Robust Multi-Object Tracking

Zuojie He; Kai Zhao; Dan Zeng

doi:10.3390/ai5030047

AI (Jun 2024)

TLtrack: Combining Transformers and a Linear Model for Robust Multi-Object Tracking

Zuojie He,
Kai Zhao,
Dan Zeng

Affiliations

Zuojie He: School of Communication and Information Engineering, Shanghai University, Shanghai 200444, China
Kai Zhao: Department of Radiology, University of California, Los Angeles, CA 90095, USA
Dan Zeng: School of Communication and Information Engineering, Shanghai University, Shanghai 200444, China

DOI: https://doi.org/10.3390/ai5030047
Journal volume & issue: Vol. 5, no. 3
pp. 938 – 947

Abstract

Read online

Multi-object tracking (MOT) aims at estimating locations and identities of objects in videos. Many modern multiple-object tracking systems follow the tracking-by-detection paradigm, consisting of a detector followed by a method for associating detections into tracks. Tracking by associating detections through motion-based similarity heuristics is the basic way. Motion models aim at utilizing motion information to estimate future locations, playing an important role in enhancing the performance of association. Recently, a large-scale dataset, DanceTrack, where objects have uniform appearance and diverse motion patterns, was proposed. With existing hand-crafted motion models, it is hard to achieve decent results on DanceTrack because of the lack of prior knowledge. In this work, we present a motion-based algorithm named TLtrack, which adopts a hybrid strategy to make motion estimates based on confidence scores. For high confidence score detections, TLtrack employs transformers to predict its locations. For low confidence score detections, a simple linear model that estimates locations through trajectory historical information is used. TLtrack can not only consider the historical information of the trajectory, but also analyze the latest movements. Our experimental results on the DanceTrack dataset show that our method achieves the best performance compared with other motion models.

Published in AI

ISSN: 2673-2688 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.mdpi.com/journal/ai

About the journal

Abstract

Keywords