Two-Stream Network Based on Visual Saliency Sharing for 3D Model Recognition

Weizhi Nie; Lu Qu; Minjie Ren; Qi Liang; Yuting Su; Yangyang Li; Hao Jin

doi:10.1109/ACCESS.2019.2963511

IEEE Access (Jan 2020)

Two-Stream Network Based on Visual Saliency Sharing for 3D Model Recognition

Weizhi Nie,
Lu Qu,
Minjie Ren,
Qi Liang,
Yuting Su,
Yangyang Li,
Hao Jin

Affiliations

Weizhi Nie: ORCiD; School of Electrical and Information Engineering, Tianjin University, Tianjin, China
Lu Qu: National Engineering Laboratory for Public Safety Risk Perception and Control by Big Data (PSRPC), Beijing, CAEIT, China
Minjie Ren: School of Electrical and Information Engineering, Tianjin University, Tianjin, China
Qi Liang: School of Electrical and Information Engineering, Tianjin University, Tianjin, China
Yuting Su: ORCiD; School of Electrical and Information Engineering, Tianjin University, Tianjin, China
Yangyang Li: National Engineering Laboratory for Public Safety Risk Perception and Control by Big Data (PSRPC), Beijing, CAEIT, China
Hao Jin: National Engineering Laboratory for Public Safety Risk Perception and Control by Big Data (PSRPC), Beijing, CAEIT, China

DOI: https://doi.org/10.1109/ACCESS.2019.2963511
Journal volume & issue: Vol. 8
pp. 5979 – 5989

Abstract

Read online

Shape representation for 3D models is an important topic in computer vision, multimedia analysis, and computer graphics. Recent multiview-based methods demonstrate promising performance for 3D model recognition and retrieval. However, most of the multiview-based methods focus on the visual information from the taken views and ignore correlation information among these views, which means the similarity and differentiation of multiple views have lost in their methods. In order to address this issue, we propose a novel two-stream network architecture for 3D model recognition and retrieval. The proposed network includes two sub-networks: a multi-view convolutional neural network (MVCNN) that extracts the view information from the taken views, and an Visual Saliency model that defines the weight of views based on the similarity and differentiation information of multiple views. Special, the weight of views defined by the Visual Saliency model can effectively be used to guide the visual information fusion in MVCNN model. This design can make the MVCNN model save visual information and the correlation information of these views in the learning step. Finally, we employ early-fusion method to fuse the feature vectors from MVCNN model and Visual Saliency model respectively, to generate the shape descriptor for 3D model recognition and retrieval. The experimental result on two public datasets, ModelNet40 and ShapeNetCore55, demonstrates the correlation information of multiple views is crucial for view-based 3D model recognition methods and the proposed method can achieve the state-of-the-art performance on both 3D object classification and retrieval.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords