Towards Fair and Comprehensive Comparisons for Image-Based 3D Object Detection

被引:0
|
作者
Ma, Xinzhu [1 ,2 ]
Wang, Yongtao [3 ]
Zhang, Yinmin [1 ,2 ]
Xia, Zhiyi [4 ]
Meng, Yuan [4 ]
Wang, Zhihui [3 ]
Li, Haojie [3 ]
Ouyang, Wanli [1 ]
机构
[1] Shanghai AI Lab, Shanghai, Peoples R China
[2] Univ Sydney, Sydney, NSW, Australia
[3] Dalian Univ Technol, Dalian, Peoples R China
[4] Tsinghua Univ, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/ICCV51070.2023.00591
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we build a modular-designed codebase, formulate strong training recipes, design an error diagnosis toolbox, and discuss current methods for image-based 3D object detection. In particular, different from other highly mature tasks, e.g., 2D object detection, the community of image-based 3D object detection is still evolving, where methods often adopt different training recipes and tricks resulting in unfair evaluations and comparisons. What is worse, these tricks may overwhelm their proposed designs in performance, even leading to wrong conclusions. To address this issue, we build a module-designed codebase and formulate unified training standards for the community. Furthermore, we also design an error diagnosis toolbox to measure the detailed characterization of detection models. Using these tools, we analyze current methods in-depth under varying settings and provide discussions for some open questions, e.g., discrepancies in conclusions on KITTI-3D and nuScenes datasets, which have led to different dominant methods for these datasets. We hope that this work will facilitate future research in image-based 3D object detection. Our codes will be released at https://github.com/OpenGVLab/3dodi.
引用
收藏
页码:6402 / 6412
页数:11
相关论文
共 50 条
  • [1] Sequential Image-based 3D Object Detection with Location Refinement
    Sim, Sangmin
    Kim, Youngseok
    Kum, Dongsuk
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3625 - 3631
  • [2] Star-Convolution for Image-Based 3D Object Detection
    Liu, Yuxuan
    Xu, Zhenhua
    Liu, Ming
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 5018 - 5024
  • [3] End-to-End Pseudo-LiDAR for Image-Based 3D Object Detection
    Qian, Rui
    Garg, Divyansh
    Wang, Yan
    You, Yurong
    Belongie, Serge
    Hariharan, Bharath
    Campbell, Mark
    Weinberger, Kilian Q.
    Chao, Wei-Lun
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5880 - 5889
  • [4] 3D image-based stereology
    Adachi, Yoshitaka
    Ojima, Mayumi
    Sato, Naoko
    Wang, Yuan Tsung
    THERMEC 2011, PTS 1-4, 2012, 706-709 : 2687 - +
  • [5] Image-based extraction of material reflectance properties of a 3D rigid object
    Erdem, ME
    Erdem, IA
    Yilmaz, UG
    Atalay, V
    PROCEEDINGS OF THE IEEE 12TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, 2004, : 245 - 248
  • [6] An Image-based System for Sharing a 3D Object by Transmitting to Remote Locations
    Kikukawa, Tetsuya
    Kitamura, Yoshifumi
    Ohno, Tsubasa
    Sakurai, Satoshi
    Yamaguchi, Tokuo
    Kishino, Fumio
    Kunita, Yutaka
    Isogai, Megumi
    Kimata, Hideaki
    Matsuura, Norihiko
    IEEE VIRTUAL REALITY 2010, PROCEEDINGS, 2010, : 277 - +
  • [7] 3D object detection based on synthetic RGB image
    Xu C.
    Li Z.
    Jiang D.
    Yun J.
    Liu Y.
    Liu Y.
    Bai D.
    Ying S.
    International Journal of Wireless and Mobile Computing, 2021, 20 (01): : 70 - 76
  • [8] Hybrid image-based collision detection in Java']Java 3D
    Xiao, Gaoyu
    Aziz, Aamer
    Nowinski, Wieslaw L.
    SOFTWARE-PRACTICE & EXPERIENCE, 2007, 37 (09): : 963 - 982
  • [9] Towards Stable 3D Object Detection
    Wang, Jiabao
    Meng, Qiang
    Liu, Guochao
    Yang, Liujiang
    Wang, Ke
    Cheng, Ming-Ming
    Hou, Qibin
    COMPUTER VISION - ECCV 2024, PT L, 2025, 15108 : 197 - 213
  • [10] Vulnerability of Feature Extractors in 2D Image-Based 3D Object Retrieval
    Liu, An-An
    Zhou, He-Yu
    Li, Xuanya
    Wang, Lanjun
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 5065 - 5076