OneShotDA: Online Multi-Object Tracker With One-Shot-Learning-Based Data Association

Kwangjin Yoon; Jeonghwan Gwak; Young-Min Song; Young-Chul Yoon; Moon-Gu Jeon

doi:10.1109/ACCESS.2020.2975912

IEEE Access (Jan 2020)

OneShotDA: Online Multi-Object Tracker With One-Shot-Learning-Based Data Association

Kwangjin Yoon,
Jeonghwan Gwak,
Young-Min Song,
Young-Chul Yoon,
Moon-Gu Jeon

Affiliations

Kwangjin Yoon: ORCiD; SI-Analytics Company, Ltd., Daejeon, South Korea
Jeonghwan Gwak: ORCiD; Department of Software, Korea National University of Transportation, Chungju, South Korea
Young-Min Song: ORCiD; School of Electrical Engineering and Computer Science, Gwangju Institute of Science Technology, Gwangju, South Korea
Young-Chul Yoon: ORCiD; Robot Center, LG Electronics, Seoul, South Korea
Moon-Gu Jeon: ORCiD; School of Electrical Engineering and Computer Science, Gwangju Institute of Science Technology, Gwangju, South Korea

DOI: https://doi.org/10.1109/ACCESS.2020.2975912
Journal volume & issue: Vol. 8
pp. 38060 – 38072

Abstract

Read online

Tracking multiple objects in a video sequence can be accomplished by identifying the objects appearing in the sequence and distinguishing between them. Therefore, many recent multi-object tracking (MOT) methods have utilized re-identification and distance metric learning to distinguish between objects by computing the similarity/dissimilarity scores. However, it is difficult to generalize such approaches for arbitrary video sequences, because some important information, such as the number of objects (classes) in a video, is not known in advance. Therefore, in this study, we applied a one-shot learning framework to the MOT problem. Our algorithm tracks objects by classifying newly observed objects into existing tracks, irrespective of the number of objects appearing in a video frame. The proposed method, called OneShotDA, exploits the one-shot learning framework based on an attention mechanism. Our neural network learns to classify unseen data samples using labels from a support set. Once the network has been trained, it predicts correct labels for newly received detection results based on the set of existing tracks. To analyze the effectiveness of our method, it was tested on the MOTchallenge benchmark datasets (MOT16 and MOT17 datasets). The results reveal that the performance of the proposed method was comparable with those of current state-of-the-art methods. In particular, it is noteworthy that the proposed method ranked first among the online trackers on the MOT17 benchmark.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords