Semi-Supervised Stereo-based 3D Object Detection via Cross-View Consensus

被引:2
|
作者
Wu, Wenhao [1 ]
Wong, Hau-San [1 ]
Wu, Si [2 ]
机构
[1] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[2] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52729.2023.01676
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Stereo-based 3D object detection, which aims at detecting 3D objects with stereo cameras, shows great potential in low-cost deployment compared to LiDAR-based methods and excellent performance compared to monocular-based algorithms. However, the impressive performance of stereo-based 3D object detection is at the huge cost of high-quality manual annotations, which are hardly attainable for any given scene. Semi-supervised learning, in which limited annotated data and numerous unannotated data are required to achieve a satisfactory model, is a promising method to address the problem of data deficiency. In this work, we propose to achieve semi-supervised learning for stereo-based 3D object detection through pseudo annotation generation from a temporal-aggregated teacher model, which temporally accumulates knowledge from a student model. To facilitate a more stable and accurate depth estimation, we introduce Temporal-Aggregation-Guided (TAG) disparity consistency, a cross-view disparity consistency constraint between the teacher model and the student model for robust and improved depth estimation. To mitigate noise in pseudo annotation generation, we propose a cross-view agreement strategy, in which pseudo annotations should attain high degree of agreements between 3D and 2D views, as well as between binocular views. We perform extensive experiments on the KITTI 3D dataset to demonstrate our proposed method's capability in leveraging a huge amount of unannotated stereo images to attain significantly improved detection results.
引用
收藏
页码:17471 / 17481
页数:11
相关论文
共 50 条
  • [21] CrossDTR: Cross-view and Depth-guided Transformers for 3D Object Detection
    Tseng, Ching-Yu
    Chen, Yi-Rong
    Lee, Hsin-Ying
    Wu, Tsung-Han
    Chen, Wen-Chin
    Hsu, Winston H.
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 4850 - 4857
  • [22] Transferable Semi-Supervised 3D Object Detection From RGB-D Data
    Tang, Yew Siang
    Lee, Gim Hee
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1931 - 1940
  • [23] Diffusion-SS3D: Diffusion Model for Semi-supervised 3D Object Detection
    Ho, Cheng-Ju
    Tai, Chen-Hsuan
    Lin, Yen-Yu
    Yang, Ming-Hsuan
    Tsai, Yi-Hsuan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [24] Semi-Supervised Online Continual Learning for 3D Object Detection in Mobile Robotics
    Liu, Binhong
    Yao, Dexin
    Yang, Rui
    Yan, Zhi
    Yang, Tao
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2024, 110 (04)
  • [25] Decoupled Pseudo-labeling for Semi-Supervised Monocular 3D Object Detection
    Zhang, Jiacheng
    Li, Jiaming
    Lin, Xiangru
    Zhang, Wei
    Tang, Xiao
    Hang, Junyu
    Ding, Errui
    Wang, Jingdong
    Li, Guanbin
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 16923 - 16932
  • [26] Hierarchical Supervision and Shuffle Data Augmentation for 3D Semi-Supervised Object Detection
    Liu, Chuandong
    Gao, Chenqiang
    Liu, Fangcen
    Li, Pengcheng
    Meng, Deyu
    Gao, Xinbo
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 23819 - 23828
  • [27] Hardness-Aware Scene Synthesis for Semi-Supervised 3D Object Detection
    Zeng, Shuai
    Zheng, Wenzhao
    Lu, Jiwen
    Yan, Haibin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9644 - 9656
  • [28] StereoDistill: Pick the Cream from LiDAR for Distilling Stereo-Based 3D Object Detection
    Liu, Zhe
    Ye, Xiaoqing
    Tan, Xiao
    Ding, Errui
    Bai, Xiang
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 1790 - 1798
  • [29] 3DIoUMatch: Leveraging IoU Prediction for Semi-Supervised 3D Object Detection
    Wang, He
    Cong, Yezhen
    Litany, Or
    Gao, Yue
    Guibas, Leonidas J.
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 14610 - 14619
  • [30] Semi-supervised surface object detection based on multi-view cross-consistency learning
    Feng J.
    Li B.
    Tian L.
    Dong C.
    Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology, 2023, 55 (04): : 107 - 114