Weakly Supervised Point Clouds Transformer for 3D Object Detection

被引:1
|
作者
Tang, Zuojin [1 ,2 ]
Sun, Bo [2 ]
Ma, Tongwei [3 ]
Li, Daosheng [3 ]
Xu, Zhenhui [3 ]
机构
[1] Southeast Univ, Coll Software Engn, Suzhou 215123, Peoples R China
[2] Chinese Acad Sci, Quanzhou Inst Equipment Mfg, Haixi Inst, Quanzhou 362000, Peoples R China
[3] Xinjiang Univ, Coll Mech Engn, Urumqi 830047, Peoples R China
来源
2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC) | 2022年
关键词
D O I
10.1109/ITSC55140.2022.9921926
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The annotation of 3D datasets is required for semantic-segmentation and object detection in scene understanding. In this paper we present a framework for the weakly supervision of a point clouds transformer that is used for 3D object detection. The aim is to decrease the required amount of supervision needed for training, as a result of the high cost of annotating a 3D datasets. We propose an Unsupervised Voting Proposal Module, which learns randomly preset anchor points and uses voting network to select prepared anchor points of high quality. Then it distills information into student and teacher network. In terms of student network, we apply ResNet network to efficiently extract local characteristics. However, it also can lose much global information. To provide the input which incorporates the global and local information as the input of student networks, we adopt the self-attention mechanism of transformer to extract global features, and the ResNet layers to extract region proposals. The teacher network supervises the classification and regression of the student network using the pre-trained model on ImageNet. On the challenging KITTI datasets, the experimental results have achieved the highest level of average precision compared with the most recent weakly supervised 3D object detectors.
引用
收藏
页码:3948 / 3955
页数:8
相关论文
共 50 条
  • [1] Weakly Supervised 3D Object Detection from Point Clouds
    Qin, Zengyi
    Wang, Jinglu
    Lu, Yan
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 4144 - 4152
  • [2] SWFormer: Sparse Window Transformer for 3D Object Detection in Point Clouds
    Sun, Pei
    Tan, Mingxing
    Wang, Weiyue
    Liu, Chenxi
    Xia, Fei
    Leng, Zhaoqi
    Anguelov, Dragomir
    COMPUTER VISION, ECCV 2022, PT X, 2022, 13670 : 426 - 442
  • [3] A Simple Vision Transformer for Weakly Semi-supervised 3D Object Detection
    Zhang, Dingyuan
    Liang, Dingkang
    Zou, Zhikang
    Li, Jingyu
    Ye, Xiaoqing
    Liu, Zhe
    Tan, Xiao
    Bai, Xiang
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 8339 - 8349
  • [4] Towards a Weakly Supervised Framework for 3D Point Cloud Object Detection and Annotation
    Meng, Qinghao
    Wang, Wenguan
    Zhou, Tianfei
    Shen, Jianbing
    Jia, Yunde
    Van Gool, Luc
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (08) : 4454 - 4468
  • [5] Clusterformer: Cluster-based Transformer for 3D Object Detection in Point Clouds
    Pei, Yu
    Zhao, Xian
    Li, Hao
    Ma, Jingyuan
    Zhang, Jingwei
    Pu, Shiliang
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6641 - 6650
  • [6] GECNN for Weakly Supervised Semantic Segmentation of 3D Point Clouds
    He, Zifen
    Zhu, Shouye
    Huang, Ying
    Zhang, Yinhui
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (12) : 2237 - 2243
  • [7] DVST: Deformable Voxel Set Transformer for 3D Object Detection from Point Clouds
    Ning, Yaqian
    Cao, Jie
    Bao, Chun
    Hao, Qun
    REMOTE SENSING, 2023, 15 (23)
  • [8] Transformer for 3D Point Clouds
    Wang, Jiayun
    Chakraborty, Rudrasis
    Yu, Stella X.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (08) : 4419 - 4431
  • [9] Self Supervised Learning for Multiple Object Tracking in 3D Point Clouds
    Kumar, Aakash
    Kini, Jyoti
    Mian, Ajmal
    Shah, Mubarak
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 3754 - 3761
  • [10] MsSVT: Mixed-scale Sparse Voxel Transformer for 3D Object Detection on Point Clouds
    Dong, Shaocong
    Ding, Lihe
    Wang, Haiyang
    Xu, Tingfa
    Xu, Xinli
    Bian, Ziyang
    Wang, Ying
    Wang, Jie
    Li, Jianan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,