Weakly Supervised Point Clouds Transformer for 3D Object Detection

被引:1
作者
Tang, Zuojin [1 ,2 ]
Sun, Bo [2 ]
Ma, Tongwei [3 ]
Li, Daosheng [3 ]
Xu, Zhenhui [3 ]
机构
[1] Southeast Univ, Coll Software Engn, Suzhou 215123, Peoples R China
[2] Chinese Acad Sci, Quanzhou Inst Equipment Mfg, Haixi Inst, Quanzhou 362000, Peoples R China
[3] Xinjiang Univ, Coll Mech Engn, Urumqi 830047, Peoples R China
来源
2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC) | 2022年
关键词
D O I
10.1109/ITSC55140.2022.9921926
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The annotation of 3D datasets is required for semantic-segmentation and object detection in scene understanding. In this paper we present a framework for the weakly supervision of a point clouds transformer that is used for 3D object detection. The aim is to decrease the required amount of supervision needed for training, as a result of the high cost of annotating a 3D datasets. We propose an Unsupervised Voting Proposal Module, which learns randomly preset anchor points and uses voting network to select prepared anchor points of high quality. Then it distills information into student and teacher network. In terms of student network, we apply ResNet network to efficiently extract local characteristics. However, it also can lose much global information. To provide the input which incorporates the global and local information as the input of student networks, we adopt the self-attention mechanism of transformer to extract global features, and the ResNet layers to extract region proposals. The teacher network supervises the classification and regression of the student network using the pre-trained model on ImageNet. On the challenging KITTI datasets, the experimental results have achieved the highest level of average precision compared with the most recent weakly supervised 3D object detectors.
引用
收藏
页码:3948 / 3955
页数:8
相关论文
共 50 条
[21]   Boosting 3D Object Detection by Simulating Multimodality on Point Clouds [J].
Zheng, Wu ;
Hong, Mingxuan ;
Jiang, Li ;
Fu, Chi-Wing .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :13628-13637
[22]   Learning Deformable Network for 3D Object Detection on Point Clouds [J].
Zhang, Wanyi ;
Fu, Xiuhua ;
Li, Wei .
MOBILE INFORMATION SYSTEMS, 2021, 2021
[23]   Optimisation of the PointPillars network for 3D object detection in point clouds [J].
Stanisz, Joanna ;
Lis, Konrad ;
Kryjak, Tomasz ;
Gorgon, Marek .
2020 SIGNAL PROCESSING - ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS (SPA), 2020, :122-127
[24]   3D Object Detection Algorithm Based on Raw Point Clouds [J].
Zhang, Dongdong ;
Guo, Jie ;
Chen, Yang .
Computer Engineering and Applications, 2024, 59 (03) :209-217
[25]   Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection [J].
Gao, Hongzhi ;
Chen, Zheng ;
Chen, Zehui ;
Chen, Lin ;
Liu, Jiaming ;
Zhang, Shanghang ;
Zhao, Feng .
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 3, 2024, :1797-1805
[26]   Boundary points guided 3D object detection for point clouds [J].
Tang, Qingsong ;
Yang, Mingzhi ;
Wang, Ziyi ;
Dong, Wenhao ;
Liu, Yang .
APPLIED SOFT COMPUTING, 2024, 165
[27]   Enhanced Vote Network for 3D Object Detection in Point Clouds [J].
Zhong, Min ;
Zeng, Gang .
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, :6624-6631
[28]   A robust scheme for copy detection of 3D object point clouds [J].
Yang, Jiaqi ;
Lu, Xuequan ;
Chen, Wenzhi .
NEUROCOMPUTING, 2022, 510 :181-192
[29]   Relation Graph Network for 3D Object Detection in Point Clouds [J].
Feng, Mingtao ;
Gilani, Syed Zulqarnain ;
Wang, Yaonan ;
Zhang, Liang ;
Mian, Ajmal .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :92-107
[30]   Density Based Clustering for 3D Object Detection in Point Clouds [J].
Ahmed, Syeda Mariam ;
Meng, Chew Chee .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :10605-10614