Learning conditional photometric stereo with high-resolution features

Yakun Ju; Yuxin Peng; Muwei Jian; Feng Gao; Junyu Dong

doi:10.1007/s41095-021-0223-y

Computational Visual Media (Oct 2021)

Learning conditional photometric stereo with high-resolution features

Yakun Ju,
Yuxin Peng,
Muwei Jian,
Feng Gao,
Junyu Dong

Affiliations

Yakun Ju: Department of Computer Science and Technology, Ocean University of China
Yuxin Peng: Wangxuan Institute of Computer Technology, Peking University
Muwei Jian: School of Computer Science and Technology, Shandong University of Finance and Economics
Feng Gao: Department of Computer Science and Technology, Ocean University of China
Junyu Dong: Department of Computer Science and Technology, Ocean University of China

DOI: https://doi.org/10.1007/s41095-021-0223-y
Journal volume & issue: Vol. 8, no. 1
pp. 105 – 118

Abstract

Read online

Abstract Photometric stereo aims to reconstruct 3D geometry by recovering the dense surface orientation of a 3D object from multiple images under differing illumination. Traditional methods normally adopt simplified reflectance models to make the surface orientation computable. However, the real reflectances of surfaces greatly limit applicability of such methods to real-world objects. While deep neural networks have been employed to handle non-Lambertian surfaces, these methods are subject to blurring and errors, especially in high-frequency regions (such as crinkles and edges), caused by spectral bias: neural networks favor low-frequency representations so exhibit a bias towards smooth functions. In this paper, therefore, we propose a self-learning conditional network with multi-scale features for photometric stereo, avoiding blurred reconstruction in such regions. Our explorations include: (i) a multi-scale feature fusion architecture, which keeps high-resolution representations and deep feature extraction, simultaneously, and (ii) an improved gradient-motivated conditionally parameterized convolution (GM-CondConv) in our photometric stereo network, with different combinations of convolution kernels for varying surfaces. Extensive experiments on public benchmark datasets show that our calibrated photometric stereo method outperforms the state-of-the-art.

Published in Computational Visual Media

ISSN: 2096-0433 (Print); 2096-0662 (Online)
Publisher: SpringerOpen
Country of publisher: China
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://www.springer.com/41095

About the journal

Abstract

Keywords