Semi-Supervised Stereo-based 3D Object Detection via Cross-View Consensus

被引:2
|
作者
Wu, Wenhao [1 ]
Wong, Hau-San [1 ]
Wu, Si [2 ]
机构
[1] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[2] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52729.2023.01676
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Stereo-based 3D object detection, which aims at detecting 3D objects with stereo cameras, shows great potential in low-cost deployment compared to LiDAR-based methods and excellent performance compared to monocular-based algorithms. However, the impressive performance of stereo-based 3D object detection is at the huge cost of high-quality manual annotations, which are hardly attainable for any given scene. Semi-supervised learning, in which limited annotated data and numerous unannotated data are required to achieve a satisfactory model, is a promising method to address the problem of data deficiency. In this work, we propose to achieve semi-supervised learning for stereo-based 3D object detection through pseudo annotation generation from a temporal-aggregated teacher model, which temporally accumulates knowledge from a student model. To facilitate a more stable and accurate depth estimation, we introduce Temporal-Aggregation-Guided (TAG) disparity consistency, a cross-view disparity consistency constraint between the teacher model and the student model for robust and improved depth estimation. To mitigate noise in pseudo annotation generation, we propose a cross-view agreement strategy, in which pseudo annotations should attain high degree of agreements between 3D and 2D views, as well as between binocular views. We perform extensive experiments on the KITTI 3D dataset to demonstrate our proposed method's capability in leveraging a huge amount of unannotated stereo images to attain significantly improved detection results.
引用
收藏
页码:17471 / 17481
页数:11
相关论文
共 50 条
  • [41] SSDA3D: Semi-supervised Domain Adaptation for 3D Object Detection from Point Cloud
    Wang, Yan
    Yin, Junbo
    Li, Wei
    Frossard, Pascal
    Yang, Ruigang
    Shen, Jianbing
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 2707 - 2715
  • [42] SEMI-SUPERVISED 3D HAND-OBJECT POSE ESTIMATION VIA POSE DICTIONARY LEARNING
    Cheng, Zida
    Chen, Siheng
    Zhang, Ya
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3632 - 3636
  • [43] Investigation of stereo-based 3D surface reconstruction
    Hemayed, EE
    Sandbek, A
    Wassal, AG
    Farag, AA
    THREE-DIMENSIONAL IMAGE CAPTURE, 1997, 3023 : 191 - 202
  • [44] ProUDA: Progressive unsupervised data augmentation for semi-Supervised 3D object detection on point cloud
    An, Pei
    Liang, Junxiong
    Ma, Tao
    Chen, Yanfei
    Wang, Liheng
    Ma, Jie
    PATTERN RECOGNITION LETTERS, 2023, 170 : 64 - 69
  • [45] 3D Model Annotation based on Semi-Supervised Learning
    Zhou, Kai
    Tian, Feng
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2014, 14 (08): : 9 - 13
  • [46] Semi-Supervised 3D Shape Segmentation via Self Refining
    Shu, Zhenyu
    Wu, Teng
    Shen, Jiajun
    Xin, Shiqing
    Liu, Ligang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 2044 - 2057
  • [47] An Alternating Guidance With Cross-View TeacherStudent Framework for Remote Sensing Semi-Supervised Semantic Segmentation
    Fu, Yujia
    Wang, Mingyang
    Vivone, Gemine
    Ding, Yunhong
    Zhang, Lin
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [48] STEREO-BASED 3D SPACE HANDWRITING RECOGNITION
    Chen, Ying-Nong
    Chuan, Chi-Hung
    Fan, Kuo-Chin
    2018 32ND INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS WORKSHOPS (WAINA), 2018, : 615 - 617
  • [49] Recursive Cross-View: Use Only 2D Detectors to Achieve 3D Object Detection Without 3D Annotations
    Shun, Gui
    Yan, Luximon
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (10) : 6659 - 6666
  • [50] FocalMix: Semi-Supervised Learning for 3D Medical Image Detection
    Wang, Dong
    Zhang, Yuan
    Zhang, Kexin
    Wang, Liwei
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3950 - 3959