Weakly Supervised Point Clouds Transformer for 3D Object Detection

被引：1

作者：

Tang, Zuojin ^{[1
,2
]}

Sun, Bo ^{[2
]}

Ma, Tongwei ^{[3
]}

Li, Daosheng ^{[3
]}

Xu, Zhenhui ^{[3
]}

机构：

[1] Southeast Univ, Coll Software Engn, Suzhou 215123, Peoples R China

[2] Chinese Acad Sci, Quanzhou Inst Equipment Mfg, Haixi Inst, Quanzhou 362000, Peoples R China

[3] Xinjiang Univ, Coll Mech Engn, Urumqi 830047, Peoples R China

来源：

2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC) | 2022年

关键词：

D O I：

10.1109/ITSC55140.2022.9921926

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The annotation of 3D datasets is required for semantic-segmentation and object detection in scene understanding. In this paper we present a framework for the weakly supervision of a point clouds transformer that is used for 3D object detection. The aim is to decrease the required amount of supervision needed for training, as a result of the high cost of annotating a 3D datasets. We propose an Unsupervised Voting Proposal Module, which learns randomly preset anchor points and uses voting network to select prepared anchor points of high quality. Then it distills information into student and teacher network. In terms of student network, we apply ResNet network to efficiently extract local characteristics. However, it also can lose much global information. To provide the input which incorporates the global and local information as the input of student networks, we adopt the self-attention mechanism of transformer to extract global features, and the ResNet layers to extract region proposals. The teacher network supervises the classification and regression of the student network using the pre-trained model on ImageNet. On the challenging KITTI datasets, the experimental results have achieved the highest level of average precision compared with the most recent weakly supervised 3D object detectors.

引用

页码：3948 / 3955

页数：8

共 50 条

[11] 3D Siamese Transformer Network for Single Object Tracking on Point Clouds [J].

Hui, Le ;

Wang, Lingpeng ;

Tang, Linghua ;

Lan, Kaihao ;

Xie, Jin ;

Yang, Jian .

COMPUTER VISION - ECCV 2022, PT II, 2022, 13662 :293-310

[12] Dense Supervision Propagation for Weakly Supervised Semantic Segmentation on 3D Point Clouds [J].

Wei, Jiacheng ;

Lin, Guosheng ;

Yap, Kim-Hui ;

Liu, Fayao ;

Hung, Tzu-Yi .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (06) :4367-4377

[13] BSTS: A Weakly-Supervised Method for Semantic Learning of 3D Point Clouds [J].

Liu, Yan ;

Hu, Qingyong ;

Guo, Yulan .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (11) :11386-11399

[14] A weakly supervised method for 3D object detection with partially annotated samples [J].

Lu, Bin ;

Li, Qing ;

Liang, Yanju .

MEASUREMENT & CONTROL, 2025, 58 (04) :525-537

[15] A weakly supervised method for 3D object detection with partially annotated samples [J].

Lu, Bin ;

Li, Qing ;

Liang, Yanju .

Measurement and Control (United Kingdom), 2025, 58 (04) :525-537

[16] General Geometry-Aware Weakly Supervised 3D Object Detection [J].

Zhang, Guowen ;

Fan, Junsong ;

Chen, Liyi ;

Zhang, Zhaoxiang ;

Lei, Zhen ;

Zhang, Lei .

COMPUTER VISION - ECCV 2024, PT LI, 2025, 15109 :290-309

[17] Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection [J].

Gao, Hongzhi ;

Chen, Zheng ;

Chen, Zehui ;

Chen, Lin ;

Liu, Jiaming ;

Zhang, Shanghang ;

Zhao, Feng .

THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 3, 2024, :1797-1805

[18] Knowledge guided object detection and identification in 3D Point Clouds [J].

Karmacharya, A. ;

Boochs, F. ;

Tietz, B. .

VIDEOMETRICS, RANGE IMAGING, AND APPLICATIONS XIII, 2015, 9528

[19] Deep Hough Voting for 3D Object Detection in Point Clouds [J].

Qi, Charles R. ;

Litany, Or ;

He, Kaiming ;

Guibas, Leonidas J. .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9276-9285

[20] 3D Object Detection with Normal-map on Point Clouds [J].

Miao, Jishu ;

Hirakawa, Tsubasa ;

Yamashita, Takayoshi ;

Fujiyoshi, Hironobu .

VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 5: VISAPP, 2021, :569-576

← 1 2 3 4 5 →