An efficient and accurate multi-level cascaded recurrent network for stereo matching

Ziyu Zhong; Xiuze Yang; Xiubian Pan; Wei Guan; Ke Liang; Jing Li; Xiaolan Liao; Shuo Wang

doi:10.1038/s41598-024-57321-6

Scientific Reports (Apr 2024)

An efficient and accurate multi-level cascaded recurrent network for stereo matching

Ziyu Zhong,
Xiuze Yang,
Xiubian Pan,
Wei Guan,
Ke Liang,
Jing Li,
Xiaolan Liao,
Shuo Wang

Affiliations

Ziyu Zhong: School of Mechanical Engineering, Guangxi University
Xiuze Yang: School of Mechanical Engineering, Guangxi University
Xiubian Pan: School of Mechanical Engineering, Guangxi University
Wei Guan: School of Mechanical Engineering, Guangxi University
Ke Liang: School of Mechanical Engineering, Guangxi University
Jing Li: School of Mechanical Engineering, Guangxi University
Xiaolan Liao: School of Mechanical Engineering, Guangxi University
Shuo Wang: School of Mechanical Engineering, Guangxi University

DOI: https://doi.org/10.1038/s41598-024-57321-6
Journal volume & issue: Vol. 14, no. 1
pp. 1 – 14

Abstract

Read online

Abstract With the advent of Transformer-based convolutional neural networks, stereo matching algorithms have achieved state-of-the-art accuracy in disparity estimation. Nevertheless, this method requires much model inference time, which is the main reason limiting its application in many vision tasks and robots. Facing the trade-off problem between accuracy and efficiency, this paper proposes an efficient and accurate multi-level cascaded recurrent network, LMCR-Stereo. To recover the detailed information of stereo images more accurately, we first design a multi-level network to update the difference values in a coarse-to-fine recurrent iterative manner. Then, we propose a new pair of slow-fast multi-stage superposition inference structures to accommodate the differences between different scene data. Besides, to ensure better disparity estimation accuracy with faster model inference speed, we introduce a pair of adaptive and lightweight group correlation layers to reduce the impact of erroneous rectification and significantly improve model inference speed. The experimental results show that the proposed approach achieves a competitive disparity estimation accuracy with a faster model inference speed than the current state-of-the-art methods. Notably, the model inference speed of the proposed approach is improved by 46.0% and 50.4% in the SceneFlow test set and Middlebury benchmark, respectively.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal

Abstract

Keywords