Star-Convolution for Image-Based 3D Object Detection

被引：0

作者：

Liu, Yuxuan ^{[1
]}

Xu, Zhenhua ^{[1
]}

Liu, Ming ^{[1
]}

机构：

[1] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R China

来源：

2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022) | 2022年

基金：

中国国家自然科学基金;

关键词：

D O I：

10.1109/ICRA46639.2022.9811612

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

3D object detection with only image inputs is an interesting and important problem in computer vision and autonomous driving. Nowadays, most existing monocular 3D object detection algorithms rely solely on the approximation power of convolutional neural networks to learn a mapping from pixels to 3D predictions without knowing the projection matrix of the camera. To endow the networks with camera projection knowledge, we propose the Star-Convolution module for application to image-based 3D detection. The introduced module increases the receptive field of the detector and embeds the camera's projection geometry inside the network while keeping the network end-to-end trainable. We test the module with different baselines in both monocular and stereo 3D object detection, and we achieve significant improvements on both tasks. The code will be published at https://github.com/Owen-Liuyuxuan/visualDet3D.

引用

页码：5018 / 5024

页数：7

共 50 条

[1] Sequential Image-based 3D Object Detection with Location Refinement
Sim, Sangmin
Kim, Youngseok
Kum, Dongsuk
2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3625 - 3631
[2] Towards Fair and Comprehensive Comparisons for Image-Based 3D Object Detection
Ma, Xinzhu
Wang, Yongtao
Zhang, Yinmin
Xia, Zhiyi
Meng, Yuan
Wang, Zhihui
Li, Haojie
Ouyang, Wanli
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6402 - 6412
[3] End-to-End Pseudo-LiDAR for Image-Based 3D Object Detection
Qian, Rui
Garg, Divyansh
Wang, Yan
You, Yurong
Belongie, Serge
Hariharan, Bharath
Campbell, Mark
Weinberger, Kilian Q.
Chao, Wei-Lun
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5880 - 5889
[4] MonoDCN: Monocular 3D object detection based on dynamic convolution
Qu, Shenming
Yang, Xinyu
Gao, Yiming
Liang, Shengbin
PLOS ONE, 2022, 17 (10):
[5] DROP SPARSE CONVOLUTION FOR 3D OBJECT DETECTION
Zhu, Taohong
Shen, Jun
Wang, Chali
Xiong, Huiyuan
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 3185 - 3189
[6] To the Point: Efficient 3D Object Detection in the Range Image with Graph Convolution Kernels
Chai, Yuning
Sun, Pei
Ngiam, Jiquan
Wang, Weiyue
Caine, Benjamin
Vasudevan, Vijay
Zhang, Xiao
Anguelov, Dragomir
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15995 - 16004
[7] 3D image-based stereology
Adachi, Yoshitaka
Ojima, Mayumi
Sato, Naoko
Wang, Yuan Tsung
THERMEC 2011, PTS 1-4, 2012, 706-709 : 2687 - +
[8] Image-based extraction of material reflectance properties of a 3D rigid object
Erdem, ME
Erdem, IA
Yilmaz, UG
Atalay, V
PROCEEDINGS OF THE IEEE 12TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, 2004, : 245 - 248
[9] An Image-based System for Sharing a 3D Object by Transmitting to Remote Locations
Kikukawa, Tetsuya
Kitamura, Yoshifumi
Ohno, Tsubasa
Sakurai, Satoshi
Yamaguchi, Tokuo
Kishino, Fumio
Kunita, Yutaka
Isogai, Megumi
Kimata, Hideaki
Matsuura, Norihiko
IEEE VIRTUAL REALITY 2010, PROCEEDINGS, 2010, : 277 - +
[10] IoT-based 3D convolution for video salient object detection
Dong, Shizhou
Gao, Zhifan
Pirbhulal, Sandeep
Bian, Gui-Bin
Zhang, Heye
Wu, Wanqing
Li, Shuo
NEURAL COMPUTING & APPLICATIONS, 2020, 32 (03): : 735 - 746

← 1 2 3 4 5 →