Multi-Scale Monocular Depth Estimation Based on Global Understanding

Jiejie Xiao; Lihong Li; Xu Su; Guopeng Tan

doi:10.1109/ACCESS.2024.3382572

IEEE Access (Jan 2024)

Multi-Scale Monocular Depth Estimation Based on Global Understanding

Jiejie Xiao,
Lihong Li,
Xu Su,
Guopeng Tan

Affiliations

Jiejie Xiao: ORCiD; School of Information and Electrical Engineering, Hebei University of Engineering, Handan, China
Lihong Li: ORCiD; Hebei Key Laboratory of Security and Protection Information Sensing and Processing, Hebei University of Engineering, Handan, China
Xu Su: ORCiD; School of Information and Electrical Engineering, Hebei University of Engineering, Handan, China
Guopeng Tan: ORCiD; School of Information and Electrical Engineering, Hebei University of Engineering, Handan, China

DOI: https://doi.org/10.1109/ACCESS.2024.3382572
Journal volume & issue: Vol. 12
pp. 46930 – 46939

Abstract

Read online

With the advancement of Convolutional Neural Networks, numerous convolutional neural network-based methods have been proposed for depth estimation and have achieved significant achievements. However, the repetitive convolutional layers and spatial pooling layers in these networks often lead to a reduction in spatial resolution and loss of local information, such as edge contours. To address this issue, this study presents a multi-scale monocular depth estimation model. Specifically, a Global Understanding Module was introduced on top of a generic encoder to increase the receptive field and capture contextual information. Additionally, the decoding process incorporates a Difference Module and a Multi-scale Cascade Module to guide the decoding information and refine edge contour details. Finally, extensive experiments were conducted using the KITTI and NYUv2 datasets. For the KITTI dataset, the Absolute Relative Error (Abs. Rel) was 0.057, and the Root Mean Squared Error (RMSE) was 2.415. On the NYUv2 dataset, Abs.Rel was 0.104, and RMSE was 0.380. These results indicate that the model performs well in accurately estimating depth information.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords