LFT-Net: Local Feature Transformer Network for Point Clouds Analysis

被引：61

作者：

Gao, Yongbin ^{[1
]}

Liu, Xuebing ^{[1
]}

Li, Jun ^{[2
]}

Fang, Zhijun ^{[1
]}

Jiang, Xiaoyan ^{[1
]}

Huq, Kazi Mohammed Saidul ^{[3
]}

机构：

[1] Shanghai Univ Engn Sci, Sch Elect & Elect Engn, Shanghai 201620, Peoples R China

[2] Guangzhou Univ, Sch Elect & Commun Engn, Res Ctr Intelligent Commun Engn, Guangzhou 510006, Peoples R China

[3] Univ South Wales, Fac Comp Engn & Sci, Pontypridd CF37 1DL, M Glam, Wales

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2023年 / 24卷 / 02期

关键词：

Point cloud compression; Transformers; Three-dimensional displays; Task analysis; Feature extraction; Convolution; Semantics; 6G; point cloud; 3D computer vision; transfomer; classification; segmentation;

D O I：

10.1109/TITS.2022.3140355

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

6G network enables the rapid connection of autonomous vehicles, the generated internet of vehicles establishes a large-scale point cloud, which requires automatic point cloud analysis to build an intelligent transportation system in terms of the 3D object detection and segmentation. Recently, a great variety of deep convolution networks have been proposed for 3D data analysis, making significant progress in the application of deep learning in 3D computer vision. Inspired by the application of transformer network in 2D computer visual tasks, and in order to increase the expression ability of local fine-grained features, we propose an effective local feature transformer network to learn local feature information and correlations between point clouds. Our network is adaptive to the arrangement of set elements through transformer module, so it is suitable for the feature extraction of local point clouds. In addition, experimental results demonstrate that our LFT-network outperforms the state-of-the-art in 3D model classification tasks on ModelNet40 dataset and segmentation tasks on S3DIS dataset.

引用

页码：2158 / 2168

页数：11

共 56 条

[1] Energy Efficient Resource Allocation in D2D-Assisted Heterogeneous Networks with Relays [J].

Ali, Mudassar ;

Qaisar, Saad ;

Naeem, Muhammad ;

Mumtaz, Shahid .

IEEE ACCESS, 2016, 4 :4902-4911

[2]

[Anonymous], 2015, PROC CVPR IEEE

[3]

[Anonymous], 2016, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2016.170

[4] Survey on the Internet of Vehicles: Network Architectures and Applications [J].

Ji B. ;

Zhang X. ;

Mumtaz S. ;

Han C. ;

Li C. ;

Wen H. ;

Wang D. .

IEEE Communications Standards Magazine, 2020, 4 (01) :34-41

[5]

Boulch A., 2017, 3DOR, V3, P17

[6] End-to-End Object Detection with Transformers [J].

Carion, Nicolas ;

Massa, Francisco ;

Synnaeve, Gabriel ;

Usunier, Nicolas ;

Kirillov, Alexander ;

Zagoruyko, Sergey .

COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229

[7] Pre-Trained Image Processing Transformer [J].

Chen, Hanting ;

Wang, Yunhe ;

Guo, Tianyu ;

Xu, Chang ;

Deng, Yiping ;

Liu, Zhenhua ;

Ma, Siwei ;

Xu, Chunjing ;

Xu, Chao ;

Gao, Wen .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :12294-12305

[8]

Chen L.-Z., 2019, ARXIV190505442

[9]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

[10]

Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171

← 1 2 3 4 5 6 →