Sensors (Sep 2022)

Absolute Camera Pose Regression Using an RGB-D Dual-Stream Network and Handcrafted Base Poses

  • Peng-Yuan Kao,
  • Rong-Rong Zhang,
  • Timothy Chen,
  • Yi-Ping Hung

DOI
https://doi.org/10.3390/s22186971
Journal volume & issue
Vol. 22, no. 18
p. 6971

Abstract

Read online

Absolute pose regression (APR) for camera localization is a single-shot approach that encodes the information of a 3D scene in an end-to-end neural network. The camera pose result of APR methods can be observed as the linear combination of the base poses. Previous APR methods’ base poses are learned from training data. However, the training data can limit the performance of the methods, which cannot be generalized to cover the entire scene. To solve this issue, we use handcrafted base poses instead of learning-based base poses, which prevents overfitting the camera poses of the training data. Moreover, we use a dual-stream network architecture to process color and depth images separately to get more accurate localization. On the 7 Scenes dataset, the proposed method is among the best in median rotation error, and in median translation error, it outperforms previous APR methods. On a more difficult dataset—Oxford RobotCar dataset, the proposed method achieves notable improvements in median translation and rotation errors compared to the state-of-the-art APR methods.

Keywords