TNPC: Transformer-based network for cloud classification☆

被引:13
作者
Zhou, Wei [1 ]
Zhao, Yiheng [1 ]
Xiao, Yi [1 ]
Min, Xuanlin [1 ]
Yi, Jun [1 ]
机构
[1] Chongqing Univ Sci & Technol, Coll Intelligent Technol & Engn, Chongqing 401331, Peoples R China
关键词
Point cloud classification; Transformer; Deep learning;
D O I
10.1016/j.eswa.2023.122438
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Point cloud classification has emerged as a vital research area in several emerging applications, including robotics and autonomous driving. However, discriminative feature learning has long been a challenging issue owing to the irregularity and disorder of point clouds. Recently, although Transformer-based methods can achieve high accuracy in point cloud learning, plenty of Transformer layers bring huge computation and memory consumption. This paper presents a novel hierarchical local-global framework based on Transformer network for point clouds, named TNPC. TNPC contains two serial Stages implementing downsampling operation, and each Stage is composed of a local feature extracting (LFE) block and a global feature extracting (GFE) block, which can reduce computation and memory consumption obviously. LFE block consists of two parallel branches, namely Transformer branch and shared multilayer perceptron (SMLP) branch, which are designed to learn relevant feature of any two points and local high-dimensional semantic features of each point between sampling centroid and its neighborhoods, respectively. The proposed two parallel branches not only improve the feature extraction effect, but also reduce the computation and memory consumption. GFE block consists of a center point contact (CPC) module and a global point cloud transformer layer (PCTL) module, which can improve the effect of global features extracting without adding the number of parameters and computation. The performance of our method is validated experimentally on ModelNet40 and ScanObjectNN datasets. Our method improves the mean accuracy on each category (mAcc) to 91.6% and 79.8% on the ModelNet40 dataset and ScanObjectNN dataset, respectively. In terms of efficiency, our method leads to a significant reduction, with only 4.73MB parameters and only 1.91GB floating-point operations (FLOPs). Experimental results demonstrate that the proposed method achieves state-of-the-art performance on classification accuracy and efficiency.
引用
收藏
页数:10
相关论文
共 34 条
[1]   PDConv: Rigid transformation invariant convolution for 3D point clouds [J].
Bello, Saifullahi Aminu ;
Wang, Cheng ;
Sun, Xiaotian ;
Deng, Haowen ;
Adam, Jibril Muhammad ;
Bhatti, Muhammad Kamran Afzal ;
Wambugu, Naftaly Muriuki .
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 210
[2]   Lightweight Attention Module for Deep Learning on Classification and Segmentation of 3-D Point Clouds [J].
Cui, Yunhao ;
An, Yi ;
Sun, Wei ;
Hu, Huosheng ;
Song, Xueguan .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2021, 70
[3]   GVCNN: Group-View Convolutional Neural Networks for 3D Shape Recognition [J].
Feng, Yifan ;
Zhang, Zizhao ;
Zhao, Xibin ;
Ji, Rongrong ;
Gao, Yue .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :264-272
[4]   LFT-Net: Local Feature Transformer Network for Point Clouds Analysis [J].
Gao, Yongbin ;
Liu, Xuebing ;
Li, Jun ;
Fang, Zhijun ;
Jiang, Xiaoyan ;
Huq, Kazi Mohammed Saidul .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (02) :2158-2168
[5]   PCT: Point cloud transformer [J].
Guo, Meng-Hao ;
Cai, Jun-Xiong ;
Liu, Zheng-Ning ;
Mu, Tai-Jiang ;
Martin, Ralph R. ;
Hu, Shi-Min .
COMPUTATIONAL VISUAL MEDIA, 2021, 7 (02) :187-199
[6]   FatNet: A Feature-attentive Network for 3D Point Cloud Processing [J].
Kaul, Chaitanya ;
Pears, Nick ;
Manandhar, Suresh .
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, :7211-7218
[7]   Multi-view convolutional vision transformer for 3D object recognition [J].
Li, Jie ;
Liu, Zhao ;
Li, Li ;
Lin, Junqin ;
Yao, Jian ;
Tu, Jingmin .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 95
[8]  
Li YY, 2018, ADV NEUR IN, V31
[9]   PQA-Net: Deep No Reference Point Cloud Quality Assessment via Multi-View Projection [J].
Liu, Qi ;
Yuan, Hui ;
Su, Honglei ;
Liu, Hao ;
Wang, Yu ;
Yang, Huan ;
Hou, Junhui .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (12) :4645-4660
[10]   Relation-Shape Convolutional Neural Network for Point Cloud Analysis [J].
Liu, Yongcheng ;
Fan, Bin ;
Xiang, Shiming ;
Pan, Chunhong .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :8887-8896