TR-Net: A Transformer-Based Neural Network for Point Cloud Processing

Luyao Liu; Enqing Chen; Yingqiang Ding

doi:10.3390/machines10070517

Machines (Jun 2022)

TR-Net: A Transformer-Based Neural Network for Point Cloud Processing

Luyao Liu,
Enqing Chen,
Yingqiang Ding

Affiliations

Luyao Liu: School of Information Engineering, Zhengzhou University, No. 100 Science Avenue, Zhengzhou 450001, China
Enqing Chen: School of Information Engineering, Zhengzhou University, No. 100 Science Avenue, Zhengzhou 450001, China
Yingqiang Ding: School of Information Engineering, Zhengzhou University, No. 100 Science Avenue, Zhengzhou 450001, China

DOI: https://doi.org/10.3390/machines10070517
Journal volume & issue: Vol. 10, no. 7
p. 517

Abstract

Read online

Point cloud is a versatile geometric representation that could be applied in computer vision tasks. On account of the disorder of point cloud, it is challenging to design a deep neural network used in point cloud analysis. Furthermore, most existing frameworks for point cloud processing either hardly consider the local neighboring information or ignore context-aware and spatially-aware features. To deal with the above problems, we propose a novel point cloud processing architecture named TR-Net, which is based on transformer. This architecture reformulates the point cloud processing task as a set-to-set translation problem. TR-Net directly operates on raw point clouds without any data transformation or annotation, which reduces the consumption of computing resources and memory usage. Firstly, a neighborhood embedding backbone is designed to effectively extract the local neighboring information from point cloud. Then, an attention-based sub-network is constructed to better learn a semantically abundant and discriminatory representation from embedded features. Finally, effective global features are yielded through feeding the features extracted by attention-based sub-network into a residual backbone. For different downstream tasks, we build different decoders. Extensive experiments on the public datasets illustrate that our approach outperforms other state-of-the-art methods. For example, our TR-Net performs 93.1% overall accuracy on the ModelNet40 dataset and the TR-Net archives a mIou of 85.3% on the ShapeNet dataset for part segmentation.

Published in Machines

ISSN: 2075-1702 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Mechanical engineering and machinery
Website: http://www.mdpi.com/journal/machines

About the journal

Abstract

Keywords