ST-MAE: robust lane detection in continuous multi-frame driving scenes based on a deep hybrid network

Rongyun Zhang; Yufeng Du; Peicheng Shi; Lifeng Zhao; Yaming Liu; Haoran Li

doi:10.1007/s40747-022-00909-0

Complex & Intelligent Systems (Nov 2022)

ST-MAE: robust lane detection in continuous multi-frame driving scenes based on a deep hybrid network

Rongyun Zhang,
Yufeng Du,
Peicheng Shi,
Lifeng Zhao,
Yaming Liu,
Haoran Li

Affiliations

Rongyun Zhang: The School of Mechanical and Automotive Engineering, Anhui Polytechnic University
Yufeng Du: The School of Mechanical and Automotive Engineering, Anhui Polytechnic University
Peicheng Shi: Automotive New Technology Anhui Engineering and Technology Research Center, Anhui Polytechnic University
Lifeng Zhao: The School of Automotive and Traffic Engineering, Hefei University of Technology
Yaming Liu: The School of Mechanical and Automotive Engineering, Anhui Polytechnic University
Haoran Li: The School of Mechanical and Automotive Engineering, Anhui Polytechnic University

DOI: https://doi.org/10.1007/s40747-022-00909-0
Journal volume & issue: Vol. 9, no. 5
pp. 4837 – 4855

Abstract

Read online

Abstract Lane detection is one of the key techniques to realize advanced driving assistance and automatic driving. However, lane detection networks based on deep learning have significant shortcomings. The detection results are often unsatisfactory when there are shadows, degraded lane markings, and vehicle occlusion lanes. Therefore, a continuous multi-frame image sequence lane detection network is proposed. Specifically, the continuous six-frame image sequence is input into the network, in which the scene information of each frame image is extracted by an encoder composed of Swin Transformer blocks and input into the PredRNN. Continuous multi-frame of the driving scene is modeled as time-series by ST-LSTM blocks, and then, the shape changes and motion trajectory in the spatiotemporal sequence are effectively modeled. Finally, through the decoder composed of Swin Transformer blocks, the features are obtained and reconstructed to complete the detection task. Extensive experiments on two large-scale datasets demonstrate that the proposed method outperforms the competing methods in lane detection, especially in handling difficult situations. Experiments are carried out based on the TuSimple dataset. The results show: for easy scenes, the validation accuracy is 97.46%, the test accuracy is 97.37%, and the precision is 0.865. For complex scenes, the validation accuracy is 97.38%, the test accuracy is 97.29%, and the precision is 0.859. The running time is 4.4 ms. Experiments are carried out based on the CULane dataset. The results show that, for easy scenes, the validation accuracy is 97.03%, the test accuracy is 96.84%, and the precision is 0.837. For complex scenes, the validation accuracy is 96.18%, the test accuracy is 95.92%, and the precision is 0.829. The running time is 6.5 ms.

Published in Complex & Intelligent Systems

ISSN: 2199-4536 (Print); 2198-6053 (Online)
Publisher: Springer
Country of publisher: Switzerland
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science; Technology: Technology (General): Industrial engineering. Management engineering: Information technology
Website: https://www.springer.com/journal/40747

About the journal

Abstract

Keywords