LFT-Net: Local Feature Transformer Network for Point Clouds Analysis

被引:56
作者
Gao, Yongbin [1 ]
Liu, Xuebing [1 ]
Li, Jun [2 ]
Fang, Zhijun [1 ]
Jiang, Xiaoyan [1 ]
Huq, Kazi Mohammed Saidul [3 ]
机构
[1] Shanghai Univ Engn Sci, Sch Elect & Elect Engn, Shanghai 201620, Peoples R China
[2] Guangzhou Univ, Sch Elect & Commun Engn, Res Ctr Intelligent Commun Engn, Guangzhou 510006, Peoples R China
[3] Univ South Wales, Fac Comp Engn & Sci, Pontypridd CF37 1DL, M Glam, Wales
关键词
Point cloud compression; Transformers; Three-dimensional displays; Task analysis; Feature extraction; Convolution; Semantics; 6G; point cloud; 3D computer vision; transfomer; classification; segmentation;
D O I
10.1109/TITS.2022.3140355
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
6G network enables the rapid connection of autonomous vehicles, the generated internet of vehicles establishes a large-scale point cloud, which requires automatic point cloud analysis to build an intelligent transportation system in terms of the 3D object detection and segmentation. Recently, a great variety of deep convolution networks have been proposed for 3D data analysis, making significant progress in the application of deep learning in 3D computer vision. Inspired by the application of transformer network in 2D computer visual tasks, and in order to increase the expression ability of local fine-grained features, we propose an effective local feature transformer network to learn local feature information and correlations between point clouds. Our network is adaptive to the arrangement of set elements through transformer module, so it is suitable for the feature extraction of local point clouds. In addition, experimental results demonstrate that our LFT-network outperforms the state-of-the-art in 3D model classification tasks on ModelNet40 dataset and segmentation tasks on S3DIS dataset.
引用
收藏
页码:2158 / 2168
页数:11
相关论文
共 56 条
  • [1] Energy Efficient Resource Allocation in D2D-Assisted Heterogeneous Networks with Relays
    Ali, Mudassar
    Qaisar, Saad
    Naeem, Muhammad
    Mumtaz, Shahid
    [J]. IEEE ACCESS, 2016, 4 : 4902 - 4911
  • [2] [Anonymous], 2015, P IEEE C COMPUTER VI, DOI [10.1109/CVPR.2015.7298801, DOI 10.1109/CVPR.2015.7298801]
  • [3] [Anonymous], 2016, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2016.170
  • [4] Survey on the Internet of Vehicles: Network Architectures and Applications
    Ji B.
    Zhang X.
    Mumtaz S.
    Han C.
    Li C.
    Wen H.
    Wang D.
    [J]. IEEE Communications Standards Magazine, 2020, 4 (01): : 34 - 41
  • [5] Boulch A., 2017, 3dor@eurographics, V3, P17, DOI [10.2312/3dor.20171047, DOI 10.2312/3DOR.20171047]
  • [6] Carion N, 2020, European conference on computer vision, P213, DOI DOI 10.1007/978-3-030-58452-813
  • [7] Pre-Trained Image Processing Transformer
    Chen, Hanting
    Wang, Yunhe
    Guo, Tianyu
    Xu, Chang
    Deng, Yiping
    Liu, Zhenhua
    Ma, Siwei
    Xu, Chunjing
    Xu, Chao
    Gao, Wen
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12294 - 12305
  • [8] Chen L.-Z., 2019, ARXIV190505442
  • [9] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
  • [10] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171