Star-Convolution for Image-Based 3D Object Detection

被引:0
|
作者
Liu, Yuxuan [1 ]
Xu, Zhenhua [1 ]
Liu, Ming [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R China
来源
2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022) | 2022年
基金
中国国家自然科学基金;
关键词
D O I
10.1109/ICRA46639.2022.9811612
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
3D object detection with only image inputs is an interesting and important problem in computer vision and autonomous driving. Nowadays, most existing monocular 3D object detection algorithms rely solely on the approximation power of convolutional neural networks to learn a mapping from pixels to 3D predictions without knowing the projection matrix of the camera. To endow the networks with camera projection knowledge, we propose the Star-Convolution module for application to image-based 3D detection. The introduced module increases the receptive field of the detector and embeds the camera's projection geometry inside the network while keeping the network end-to-end trainable. We test the module with different baselines in both monocular and stereo 3D object detection, and we achieve significant improvements on both tasks. The code will be published at https://github.com/Owen-Liuyuxuan/visualDet3D.
引用
收藏
页码:5018 / 5024
页数:7
相关论文
共 50 条
  • [1] Sequential Image-based 3D Object Detection with Location Refinement
    Sim, Sangmin
    Kim, Youngseok
    Kum, Dongsuk
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3625 - 3631
  • [2] Towards Fair and Comprehensive Comparisons for Image-Based 3D Object Detection
    Ma, Xinzhu
    Wang, Yongtao
    Zhang, Yinmin
    Xia, Zhiyi
    Meng, Yuan
    Wang, Zhihui
    Li, Haojie
    Ouyang, Wanli
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6402 - 6412
  • [3] End-to-End Pseudo-LiDAR for Image-Based 3D Object Detection
    Qian, Rui
    Garg, Divyansh
    Wang, Yan
    You, Yurong
    Belongie, Serge
    Hariharan, Bharath
    Campbell, Mark
    Weinberger, Kilian Q.
    Chao, Wei-Lun
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5880 - 5889
  • [4] MonoDCN: Monocular 3D object detection based on dynamic convolution
    Qu, Shenming
    Yang, Xinyu
    Gao, Yiming
    Liang, Shengbin
    PLOS ONE, 2022, 17 (10):
  • [5] DROP SPARSE CONVOLUTION FOR 3D OBJECT DETECTION
    Zhu, Taohong
    Shen, Jun
    Wang, Chali
    Xiong, Huiyuan
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 3185 - 3189
  • [6] To the Point: Efficient 3D Object Detection in the Range Image with Graph Convolution Kernels
    Chai, Yuning
    Sun, Pei
    Ngiam, Jiquan
    Wang, Weiyue
    Caine, Benjamin
    Vasudevan, Vijay
    Zhang, Xiao
    Anguelov, Dragomir
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15995 - 16004
  • [7] 3D image-based stereology
    Adachi, Yoshitaka
    Ojima, Mayumi
    Sato, Naoko
    Wang, Yuan Tsung
    THERMEC 2011, PTS 1-4, 2012, 706-709 : 2687 - +
  • [8] Image-based extraction of material reflectance properties of a 3D rigid object
    Erdem, ME
    Erdem, IA
    Yilmaz, UG
    Atalay, V
    PROCEEDINGS OF THE IEEE 12TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, 2004, : 245 - 248
  • [9] An Image-based System for Sharing a 3D Object by Transmitting to Remote Locations
    Kikukawa, Tetsuya
    Kitamura, Yoshifumi
    Ohno, Tsubasa
    Sakurai, Satoshi
    Yamaguchi, Tokuo
    Kishino, Fumio
    Kunita, Yutaka
    Isogai, Megumi
    Kimata, Hideaki
    Matsuura, Norihiko
    IEEE VIRTUAL REALITY 2010, PROCEEDINGS, 2010, : 277 - +
  • [10] IoT-based 3D convolution for video salient object detection
    Dong, Shizhou
    Gao, Zhifan
    Pirbhulal, Sandeep
    Bian, Gui-Bin
    Zhang, Heye
    Wu, Wanqing
    Li, Shuo
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (03): : 735 - 746