Self‐supervised binocular depth estimation algorithm with self‐rectification for autonomous driving

Jingyao Bao; Hongfei Yu; Yongjia Zou; Jin Lv; Wei Liu; Yang Cao

doi:10.1049/itr2.12522

IET Intelligent Transport Systems (Aug 2024)

Self‐supervised binocular depth estimation algorithm with self‐rectification for autonomous driving

Jingyao Bao,
Hongfei Yu,
Yongjia Zou,
Jin Lv,
Wei Liu,
Yang Cao

Affiliations

Jingyao Bao: School of Artificial Intelligence and Software Liaoning Petrochemical University Fushun China
Hongfei Yu: School of Artificial Intelligence and Software Liaoning Petrochemical University Fushun China
Yongjia Zou: School of Artificial Intelligence and Software Liaoning Petrochemical University Fushun China
Jin Lv: Neusoft Reach Automotive Technology (Shenyang) Co., Ltd. Shenyang China
Wei Liu: Neusoft Reach Automotive Technology (Shenyang) Co., Ltd. Shenyang China
Yang Cao: School of Artificial Intelligence and Software Liaoning Petrochemical University Fushun China

DOI: https://doi.org/10.1049/itr2.12522
Journal volume & issue: Vol. 18, no. 8
pp. 1445 – 1458

Abstract

Read online

Abstract Aiming to address the challenge where existing methods struggle to predict accurate disparities for imperfectly rectified stereo images, and that supervised training requires a considerable amount of ground truth, a self‐supervised binocular depth estimation algorithm with self‐rectification for autonomous driving is proposed. Firstly, a subnetwork dedicated to stereo rectification, aiming to estimate the homography between stereo images is developed. This homography facilitates the transformation of stereo image pairs, aligning their corresponding pixels horizontally. Secondly, a foundational self‐supervised framework primarily centred on minimizing errors in stereo image reconstruction, combined with the generative‐adversarial strategy is introduced. Finally, a vertical offset prediction module (VOPM) is incorporated into the basic framework to further enhance the resistance of the stereo matching network to pixel‐level vertical offset errors. Experimental results on the public KITTI dataset for autonomous driving demonstrate the effectiveness of this approach in improving the disparity prediction performance for imperfectly rectified stereo images. Moreover, the self‐supervised training framework exhibits superiority over state‐of‐the‐art methods.

Published in IET Intelligent Transport Systems

ISSN: 1751-956X (Print); 1751-9578 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Technology: Engineering (General). Civil engineering (General): Transportation engineering; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://ietresearch.onlinelibrary.wiley.com/journal/17519578

About the journal

Abstract

Keywords