Encoder-Decoder Structure With the Feature Pyramid for Depth Estimation From a Single Image

Mengxia Tang; Songnan Chen; Ruifang Dong; Jiangming Kan

doi:10.1109/ACCESS.2021.3055497

IEEE Access (Jan 2021)

Encoder-Decoder Structure With the Feature Pyramid for Depth Estimation From a Single Image

Mengxia Tang,
Songnan Chen,
Ruifang Dong,
Jiangming Kan

Affiliations

Mengxia Tang: ORCiD; School of Technology, Beijing Forestry University, Beijing, China
Songnan Chen: ORCiD; School of Technology, Beijing Forestry University, Beijing, China
Ruifang Dong: ORCiD; School of Technology, Beijing Forestry University, Beijing, China
Jiangming Kan: ORCiD; School of Technology, Beijing Forestry University, Beijing, China

DOI: https://doi.org/10.1109/ACCESS.2021.3055497
Journal volume & issue: Vol. 9
pp. 22640 – 22650

Abstract

Read online

We address the problem of depth estimation from a single monocular image in the paper. Depth estimation from a single image is an ill-posed and inherently ambiguous problem. In the paper, we propose an encoder-decoder structure with the feature pyramid to predict the depth map from a single RGB image. More specifically, the feature pyramid is used to detect objects of different scales in the image. The encoder structure aims to extract the most representative information from the original image through a series of convolution operations and to reduce the resolution of the input image. We adopt Res2-50 as the encoder to extract important features. The decoder section uses a novel upsampling structure to improve the output resolution. Then, we also propose a novel loss function that adds gradient loss and surface normal loss to the depth loss, which can predict not only the global depth but also the depth of fuzzy edges and small objects. Additionally, we use Adam as our optimization function to optimize our network and speed up convergence. Our extensive experimental evaluation proves the efficiency and effectiveness of the method, which is competitive with previous methods on the Make3D dataset and outperforms state-of-the-art methods on the NYU Depth v2 dataset.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords