A Multi-Task Network Based on Dual-Neck Structure for Autonomous Driving Perception

Guopeng Tan; Chao Wang; Zhihua Li; Yuanbiao Zhang; Ruikai Li

doi:10.3390/s24051547

Sensors (Feb 2024)

A Multi-Task Network Based on Dual-Neck Structure for Autonomous Driving Perception

Guopeng Tan,
Chao Wang,
Zhihua Li,
Yuanbiao Zhang,
Ruikai Li

Affiliations

Guopeng Tan: School of Information & Electrical Engineering, Hebei University of Engineering, Handan 056038, China
Chao Wang: School of Information & Electrical Engineering, Hebei University of Engineering, Handan 056038, China
Zhihua Li: School of Information & Electrical Engineering, Hebei University of Engineering, Handan 056038, China
Yuanbiao Zhang: School of Information & Electrical Engineering, Hebei University of Engineering, Handan 056038, China
Ruikai Li: School of Information & Electrical Engineering, Hebei University of Engineering, Handan 056038, China

DOI: https://doi.org/10.3390/s24051547
Journal volume & issue: Vol. 24, no. 5
p. 1547

Abstract

Read online

A vision-based autonomous driving perception system necessitates the accomplishment of a suite of tasks, including vehicle detection, drivable area segmentation, and lane line segmentation. In light of the limited computational resources available, multi-task learning has emerged as the preeminent methodology for crafting such systems. In this article, we introduce a highly efficient end-to-end multi-task learning model that showcases promising performance on all fronts. Our approach entails the development of a reliable feature extraction network by introducing a feature extraction module called C2SPD. Moreover, to account for the disparities among various tasks, we propose a dual-neck architecture. Finally, we present an optimized design for the decoders of each task. Our model evinces strong performance on the demanding BDD100K dataset, attaining remarkable accuracy (Acc) in vehicle detection and superior precision in drivable area segmentation (mIoU). In addition, this is the first work that can process these three visual perception tasks simultaneously in real time on an embedded device Atlas 200I A2 and maintain excellent accuracy.

Published in Sensors

ISSN: 1424-8220 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Chemical technology
Website: http://www.mdpi.com/journal/sensors

About the journal

Abstract

Keywords