Semi-Supervised Stereo-based 3D Object Detection via Cross-View Consensus

被引:2
|
作者
Wu, Wenhao [1 ]
Wong, Hau-San [1 ]
Wu, Si [2 ]
机构
[1] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[2] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52729.2023.01676
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Stereo-based 3D object detection, which aims at detecting 3D objects with stereo cameras, shows great potential in low-cost deployment compared to LiDAR-based methods and excellent performance compared to monocular-based algorithms. However, the impressive performance of stereo-based 3D object detection is at the huge cost of high-quality manual annotations, which are hardly attainable for any given scene. Semi-supervised learning, in which limited annotated data and numerous unannotated data are required to achieve a satisfactory model, is a promising method to address the problem of data deficiency. In this work, we propose to achieve semi-supervised learning for stereo-based 3D object detection through pseudo annotation generation from a temporal-aggregated teacher model, which temporally accumulates knowledge from a student model. To facilitate a more stable and accurate depth estimation, we introduce Temporal-Aggregation-Guided (TAG) disparity consistency, a cross-view disparity consistency constraint between the teacher model and the student model for robust and improved depth estimation. To mitigate noise in pseudo annotation generation, we propose a cross-view agreement strategy, in which pseudo annotations should attain high degree of agreements between 3D and 2D views, as well as between binocular views. We perform extensive experiments on the KITTI 3D dataset to demonstrate our proposed method's capability in leveraging a huge amount of unannotated stereo images to attain significantly improved detection results.
引用
收藏
页码:17471 / 17481
页数:11
相关论文
共 50 条
  • [1] Semi-supervised Monocular 3D Object Detection by Multi-view Consistency
    Lian, Qing
    Xu, Yanbo
    Yao, Weilong
    Chen, Yingcong
    Zhang, Tong
    COMPUTER VISION, ECCV 2022, PT VIII, 2022, 13668 : 715 - 731
  • [2] Semi-supervised 3D Object Detection with PatchTeacher and PillarMix
    Wu, Xiaopei
    Peng, Liang
    Xie, Liang
    Hou, Yuenan
    Lin, Binbin
    Huang, Xiaoshui
    Liu, Haifeng
    Cai, Deng
    Ouyang, Wanli
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 6153 - 6161
  • [3] Joint Semi-Supervised and Active Learning via 3D Consistency for 3D Object Detection
    Hwang, Sihwan
    Kim, Sanmin
    Kim, Youngseok
    Kum, Dongsuk
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 4819 - 4825
  • [4] Semi-supervised 3D Object Detection with Proficient Teachers
    Yin, Junbo
    Fang, Jin
    Zhou, Dingfu
    Zhang, Liangjun
    Xu, Cheng-Zhong
    Shen, Jianbing
    Wang, Wenguan
    COMPUTER VISION, ECCV 2022, PT XXXVIII, 2022, 13698 : 727 - 743
  • [5] Semi-Supervised Sequence Modeling with Cross-View Training
    Clark, Kevin
    Luong, Minh-Thang
    Manning, Christopher D.
    Le, Quoc V.
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 1914 - 1925
  • [6] Semi-supervised 3D Object Detection via Temporal Graph Neural Networks
    Wang, Jianren
    Gang, Haiming
    Ancha, Siddarth
    Chen, Yi-Ting
    Held, David
    2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 413 - 422
  • [7] SEMI-SUPERVISED 3D OBJECT DETECTION VIA ADAPTIVE PSEUDO-LABELING
    Xu, Hongyi
    Liu, Fengqi
    Zhou, Qianyu
    Hao, Jinkun
    Cao, Zhijie
    Feng, Zhengyang
    Ma, Lizhuang
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3183 - 3187
  • [8] Semi-supervised Hashing for Semi-Paired Cross-View Retrieval
    Yu, Jun
    Wu, Xiao-Jun
    Kittler, Josef
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 958 - 963
  • [9] VISTA: Boosting 3D Object Detection via Dual Cross-VIew SpaTial Attention
    Deng, Shengheng
    Liang, Zhihao
    Sun, Lin
    Jia, Kui
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8438 - 8447
  • [10] Learning with Noisy Data for Semi-Supervised 3D Object Detection
    Chen, Zehui
    Li, Zhenyu
    Wang, Shuo
    Fu, Dengpan
    Zhao, Feng
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6906 - 6916