Monocular 3D Object Detection Utilizing Auxiliary Learning With Deformable Convolution

被引:2
作者
Chen, Jiun-Han [1 ]
Shieh, Jeng-Lun [1 ]
Haq, Muhamad Amirul [1 ]
Ruan, Shanq-Jang [1 ]
机构
[1] Natl Taiwan Univ Sci & Technol, Dept Elect & Comp Engn, Taipei 10607, Taiwan
关键词
Three-dimensional displays; Object detection; Solid modeling; Feature extraction; Training; Computational modeling; Task analysis; 3D object detection; monocular camera; driving scene understanding; auxiliary learning; deep learning;
D O I
10.1109/TITS.2023.3319556
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
In autonomous driving systems, the monocular 3D object detection algorithm is a crucial component. The safety of autonomous vehicles heavily depends on a well-designed detection system. Therefore, developing a robust and efficient 3D object detection algorithm is a major goal for institutes and researchers. Having a 3D sense is essential in autonomous vehicles and robotics, as it allows the system to understand its surroundings and react accordingly. Compared with stereo-based and Lidar-based methods, monocular 3D Object detection is a challenging task as it only utilizes 2D information to generate complex 3D features, making it low-cost, less computationally intensive, and with great potential. However, the performance of monocular methods is impaired due to the lack of depth information. In this paper, we propose a simple, end-to-end, and effective network for monocular 3D object detection without the use of external training data. Our work is inspired by auxiliary learning, in which we use a robust feature extractor as our backbone and multiple regression heads to learn auxiliary knowledge. These auxiliary regression heads will be discarded after training for improved inference efficiency, allowing us to take advantage of auxiliary learning and enabling the model to learn critical information more conceptually. The proposed method achieves 17.28% and 20.10% for the moderate level of the Car category on the KITTI benchmark test set and validation set, respectively, which outperforms the previous monocular 3D object detection approaches.
引用
收藏
页码:2424 / 2436
页数:13
相关论文
共 50 条
  • [1] Aerial Monocular 3D Object Detection
    Hu, Yue
    Fang, Shaoheng
    Xie, Weidi
    Chen, Siheng
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (04) : 1959 - 1966
  • [2] Shape-Aware Monocular 3D Object Detection
    Chen, Wei
    Zhao, Jie
    Zhao, Wan-Lei
    Wu, Song-Yuan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (06) : 6416 - 6424
  • [3] Monocular 3D Object Detection With Motion Feature Distillation
    Hu, Henan
    Li, Muyu
    Zhu, Ming
    Gao, Wen
    Liu, Peiyu
    Chan, Kwok-Leung
    IEEE ACCESS, 2023, 11 : 82933 - 82945
  • [4] MonoSample: Synthetic 3D Data Augmentation Method in Monocular 3D Object Detection
    Qiao, Junchao
    Liu, Biao
    Yang, Jiaqi
    Wang, Baohua
    Xiu, Sanmu
    Du, Xin
    Nie, Xiaobo
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (08): : 7326 - 7332
  • [5] MonoMPV: Monocular 3D Object Detection With Multiple Projection Views on Edge Devices
    Deng, Zhaoxue
    Hao, Bingsen
    Liu, Guofang
    Li, Xingquan
    Wei, Hanbing
    Huang, Fei
    Liu, Shengshu
    IEEE ACCESS, 2024, 12 : 136599 - 136612
  • [6] One Stage Monocular 3D Object Detection Utilizing Discrete Depth and Orientation Representation
    Haq, Muhamad Amirul
    Ruan, Shanq-Jang
    Shao, Mei-En
    ul Haq, Qazi Mazhar
    Liang, Pei-Jung
    Gao, De-Qin
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (11) : 21630 - 21640
  • [7] GUPNet plus plus : Geometry Uncertainty Propagation Network for Monocular 3D Object Detection
    Lu, Yan
    Ma, Xinzhu
    Yang, Lei
    Zhang, Tianzhu
    Liu, Yating
    Chu, Qi
    He, Tong
    Li, Yonghui
    Ouyang, Wanli
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (02) : 900 - 915
  • [8] Constructing 3D Object Detectors Based on Deformable Convolutional Guided Depths
    Zheng, Xinwang
    Yang, Guangsong
    Yang, Lu
    Lu, Chengyu
    IEEE ACCESS, 2024, 12 : 162990 - 163000
  • [9] SSD-MonoDETR: Supervised Scale-Aware Deformable Transformer for Monocular 3D Object Detection
    He, Xuan
    Yang, Fan
    Yang, Kailun
    Lin, Jiacheng
    Fu, Haolong
    Wang, Meng
    Yuan, Jin
    Li, Zhiyong
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 555 - 567
  • [10] MonoSIM: Simulating Learning Behaviors of Heterogeneous Point Cloud Object Detectors for Monocular 3-D Object Detection
    Sun, Han
    Fan, Zhaoxin
    Song, Zhenbo
    Wang, Zhicheng
    Wu, Kejian
    Lu, Jianfeng
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73