Hourglass MobileNetV3_large Stereo Matching Network

Lijie Yang; Mengjun Zhang; Jiehui Liu; Jinxi Guo; Jianshen Li

doi:10.1109/ACCESS.2024.3492338

IEEE Access (Jan 2024)

Hourglass MobileNetV3_large Stereo Matching Network

Lijie Yang,
Mengjun Zhang,
Jiehui Liu,
Jinxi Guo,
Jianshen Li

Affiliations

Lijie Yang: ORCiD; School of Mechanical and Equipment Engineering, Hebei University of Engineering, Handan, China
Mengjun Zhang: ORCiD; School of Mechanical and Equipment Engineering, Hebei University of Engineering, Handan, China
Jiehui Liu: School of Mechanical and Equipment Engineering, Hebei University of Engineering, Handan, China
Jinxi Guo: School of Mechanical and Equipment Engineering, Hebei University of Engineering, Handan, China
Jianshen Li: School of Mechanical and Equipment Engineering, Hebei University of Engineering, Handan, China

DOI: https://doi.org/10.1109/ACCESS.2024.3492338
Journal volume & issue: Vol. 12
pp. 168779 – 168790

Abstract

Read online

Stereo matching using rich contextual information can reduce false matches in pathological regions. Inadequate extraction of contextual feature information as well as high complexity matching models are the main reasons for the low accuracy and poor generalization of stereo matching models. To solve the above problems, we propose an Hourglass MobileNetV3_large stereo matching network. Firstly, the model is based on the lightweight MobileNetV3_large, which recovers and fuses feature information for high-resolution feature maps by combining feature maps at different scales. Secondly, the hybrid attention mechanism is embedded into the model to complete the further enhancement of the capture of global contextual information. The matching cost volume of lightweight group-wise correlation is utilized to complete the measurement of the difference between the corresponding pixels of the left and right images. Finally, the optimization of the parallax map is done by a modified hourglass-type regularization module. The model is tested and evaluated on SceneFlow, KITTI2012, KITTI2015, and Middlebury 2014. Experimental results show that the network model proposed in this paper even outperforms PSMNet algorithm and GwcNet algorithm on the KITTI2015 dataset in terms of All-D1-all metrics by 18.5% and 10.4%, respectively. Test results at Middlebury 2014 also show that the algorithm in this paper has good generalization performance. In addition, the network framework can also be applied to a variety of neural network models and good experimental results have been obtained for all of them.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords