Deep Learning for Real-Time 3D Multi-Object Detection, Localisation, and Tracking: Application to Smart Mobility

Antoine Mauri; Redouane Khemmar; Benoit Decoux; Nicolas Ragot; Romain Rossi; Rim Trabelsi; Rémi Boutteau; Jean-Yves Ertaud; Xavier Savatier

doi:10.3390/s20020532

Sensors (Jan 2020)

Deep Learning for Real-Time 3D Multi-Object Detection, Localisation, and Tracking: Application to Smart Mobility

Antoine Mauri,
Redouane Khemmar,
Benoit Decoux,
Nicolas Ragot,
Romain Rossi,
Rim Trabelsi,
Rémi Boutteau,
Jean-Yves Ertaud,
Xavier Savatier

Affiliations

Antoine Mauri: Normandie University, UNIROUEN, ESIGELEC, IRSEEM, 76000 Rouen, France
Redouane Khemmar: Normandie University, UNIROUEN, ESIGELEC, IRSEEM, 76000 Rouen, France
Benoit Decoux: Normandie University, UNIROUEN, ESIGELEC, IRSEEM, 76000 Rouen, France
Nicolas Ragot: Normandie University, UNIROUEN, ESIGELEC, IRSEEM, 76000 Rouen, France
Romain Rossi: Normandie University, UNIROUEN, ESIGELEC, IRSEEM, 76000 Rouen, France
Rim Trabelsi: Normandie University, UNIROUEN, ESIGELEC, IRSEEM, 76000 Rouen, France
Rémi Boutteau: Normandie University, UNIROUEN, ESIGELEC, IRSEEM, 76000 Rouen, France
Jean-Yves Ertaud: Normandie University, UNIROUEN, ESIGELEC, IRSEEM, 76000 Rouen, France
Xavier Savatier: Normandie University, UNIROUEN, ESIGELEC, IRSEEM, 76000 Rouen, France

DOI: https://doi.org/10.3390/s20020532
Journal volume & issue: Vol. 20, no. 2
p. 532

Abstract

Read online

In core computer vision tasks, we have witnessed significant advances in object detection, localisation and tracking. However, there are currently no methods to detect, localize and track objects in road environments, and taking into account real-time constraints. In this paper, our objective is to develop a deep learning multi object detection and tracking technique applied to road smart mobility. Firstly, we propose an effective detector-based on YOLOv3 which we adapt to our context. Subsequently, to localize successfully the detected objects, we put forward an adaptive method aiming to extract 3D information, i.e., depth maps. To do so, a comparative study is carried out taking into account two approaches: Monodepth2 for monocular vision and MADNEt for stereoscopic vision. These approaches are then evaluated over datasets containing depth information in order to discern the best solution that performs better in real-time conditions. Object tracking is necessary in order to mitigate the risks of collisions. Unlike traditional tracking approaches which require target initialization beforehand, our approach consists of using information from object detection and distance estimation to initialize targets and to track them later. Expressly, we propose here to improve SORT approach for 3D object tracking. We introduce an extended Kalman filter to better estimate the position of objects. Extensive experiments carried out on KITTI dataset prove that our proposal outperforms state-of-the-art approches.

Published in Sensors

ISSN: 1424-8220 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Chemical technology
Website: http://www.mdpi.com/journal/sensors

About the journal

Abstract

Keywords