Towards unified on-road object detection and depth estimation from a single image

被引:0
作者
Guofei Lian
Yan Wang
Huabiao Qin
Guancheng Chen
机构
[1] South China University of Technology,School of Electronic and Information Engineering
来源
International Journal of Machine Learning and Cybernetics | 2022年 / 13卷
关键词
On-road object detection; Depth estimation; Monocular image; Convolution neural network; YOLOv3;
D O I
暂无
中图分类号
学科分类号
摘要
On-road object detection based on convolutional neural network (CNN) is an important problem in the field of automatic driving. However, traditional 2D object detection aims to accomplish object classification and location in image space, lacking the ability to acquire the depth information. Besides, it is inefficient to cascade the object detection and monocular depth estimation network for realizing 2.5D object detection. To address this problem, we propose a unified multi-task learning mechanism of object detection and depth estimation. Firstly, we propose an innovative loss function, namely projective consistency loss, which uses the perspective projection principle to model the transformation relationship between the target size and the depth value. Therefore, the object detection task and the depth estimation task can be mutually constrained. Then, we propose a global multi-scale feature extracting scheme by combining the Global Context (GC) and Atrous Spatial Pyramid Pooling (ASPP) block in an appropriate way, which can promote effective feature learning and collaborative learning between object detection and depth estimation. Comprehensive experiments conducted on KITTI and Cityscapes dataset show that our approach achieves high mAP and low distance estimation error, outperforming other state-of-the-art methods.
引用
收藏
页码:1231 / 1241
页数:10
相关论文
共 50 条
[21]   Depth estimation and occlusion boundary recovery from a single outdoor image [J].
Zhang, Shihui ;
Yan, Shuo .
OPTICAL ENGINEERING, 2012, 51 (08)
[22]   IMPROVED DEEP LEARNING ARCHITECTURE FOR DEPTH ESTIMATION FROM SINGLE IMAGE [J].
Abuowaida, Suhaila F. A. ;
Chan, Huah Yong .
JORDANIAN JOURNAL OF COMPUTERS AND INFORMATION TECHNOLOGY, 2020, 6 (04) :434-445
[23]   Depth Estimation from a Single Omnidirectional Image using Domain Adaptation [J].
Wu, Yihong ;
Heng, Yuwen ;
Niranjan, Mahesan ;
Kim, Hansung .
18TH ACM SIGGRAPH EUROPEAN CONFERENCE ON VISUAL MEDIA PRODUCTION, CVMP 2021, 2021,
[24]   YOLO MDE: Object Detection with Monocular Depth Estimation [J].
Yu, Jongsub ;
Choi, Hyukdoo .
ELECTRONICS, 2022, 11 (01)
[25]   A GENERALIZED DEPTH ESTIMATION ALGORITHM WITH A SINGLE IMAGE [J].
LAI, SH ;
FU, CW ;
CHANG, SY .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1992, 14 (04) :405-411
[26]   UNSUPERVISED SINGLE IMAGE UNDERWATER DEPTH ESTIMATION [J].
Gupta, Honey ;
Mitra, Kaushik .
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, :624-628
[27]   Exploiting Depth From Single Monocular Images for Object Detection and Semantic Segmentation [J].
Cao, Yuanzhouhan ;
Shen, Chunhua ;
Shen, Heng Tao .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (02) :836-846
[28]   Depth estimation from single-shot monocular endoscope image using image domain adaptation and edge-aware depth estimation [J].
Oda, Masahiro ;
Itoh, Hayato ;
Tanaka, Kiyohito ;
Takabatake, Hirotsugu ;
Mori, Masaki ;
Natori, Hiroshi ;
Mori, Kensaku .
COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2022, 10 (03) :266-273
[29]   FLOOR DETECTION BASED DEPTH ESTIMATION FROM A SINGLE INDOOR SCENE [J].
Chun, Changhwan ;
Park, Dongjin ;
Kim, Wonjun ;
Kim, Changick .
2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, :3358-3362
[30]   Estimation of the depth information from single view image sequence with camera translation [J].
Park, JH ;
Kim, SG ;
Kim, HW ;
Yoon, YW ;
Jeoune, DS .
CISST'2000: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON IMAGING SCIENCE, SYSTEMS, AND TECHNOLOGY, VOLS I AND II, 2000, :179-183