Multimodal Fusion of Deeply Inferred Point Clouds for 3D Scene Reconstruction Using Cross-Entropy ICP

Watcharaphong Yookwan; Krisana Chinnasarn; Chakchai So-In; Paramate Horkaew

doi:10.1109/ACCESS.2022.3192869

IEEE Access (Jan 2022)

Multimodal Fusion of Deeply Inferred Point Clouds for 3D Scene Reconstruction Using Cross-Entropy ICP

Watcharaphong Yookwan,
Krisana Chinnasarn,
Chakchai So-In,
Paramate Horkaew

Affiliations

Watcharaphong Yookwan: School of Computer Engineering, Institute of Engineering, Suranaree University of Technology, Nakhon Ratchasima, Thailand
Krisana Chinnasarn: ORCiD; Faculty of Informatics, Burapha University, Chon Buri, Thailand
Chakchai So-In: ORCiD; Applied Network Technology (ANT) Laboratory, College of Computing, Khon Kaen University, Khon Kaen, Thailand
Paramate Horkaew: ORCiD; School of Computer Engineering, Institute of Engineering, Suranaree University of Technology, Nakhon Ratchasima, Thailand

DOI: https://doi.org/10.1109/ACCESS.2022.3192869
Journal volume & issue: Vol. 10
pp. 77123 – 77136

Abstract

Read online

Depth estimation is a crucial step toward 3D scene understanding. Most traditional systems rely on direct sensing of this information by means of photogrammetry or on stereo imaging. As the scenes getting more complex, these modalities were impeded by, for instances, occlusion and imperfect lighting condition, etc. As a consequence, reconstructed surfaces are normally left with voids, due to missing data. Therefore, surface regularization is often required as post-processing. With the recent advances in deep learning, depth inference from a monocular image has attracted considerable interests. Many convolutional architectures have been proposed to infer depth information from a monocular image, with promising results. Thus far, visual cues learned and generalized by these networks may be ambiguous, resulting in inaccurate estimation. To address these issues, this paper presents an effective method for fusing point clouds extracted from depth values, directly measured by an infrared camera and estimated by a modified ResNet-50 from an RGB image, of the same scene. To ensure robustness and efficiency of finding the correspondence between and aligning these point clouds, an information theoretic alignment strategy, called CEICP, was proposed. The experimental results on a public dataset demonstrated that the proposed method outperformed its counterparts, while producing good quality surface renditions of the underlying scene.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords