VCANet: Vanishing-Point-Guided Context-Aware Network for Small Road Object Detection

被引:30
作者
Chen, Guang [1 ,2 ]
Chen, Kai [1 ]
Zhang, Lijun [1 ]
Zhang, Liming [3 ]
Knoll, Alois [2 ]
机构
[1] Tongji Univ, Shanghai, Peoples R China
[2] Tech Univ Munich, Munich, Germany
[3] Geely Res Inst, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Autonomous driving; Road hazard; Object detection; Deep learning; Vanishing point; VISION;
D O I
10.1007/s42154-021-00157-x
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Advanced deep learning technology has made great progress in generic object detection of autonomous driving, yet it is still challenging to detect small road hazards in a long distance owing to lack of large-scale small-object datasets and dedicated methods. This work addresses the challenge from two aspects. Firstly, a self-collected long-distance road object dataset (TJ-LDRO) is introduced, which consists of 109,337 images and is the largest dataset so far for the small road object detection research. Secondly, a vanishing-point-guided context-aware network (VCANet) is proposed, which utilizes the vanishing point prediction block and the context-aware center detection block to obtain semantic information. The multi-scale feature fusion pipeline and the upsampling block in VCANet are introduced to enhance the region of interest (ROI) feature. The experimental results with TJ-LDRO dataset show that the proposed method achieves better performance than the representative generic object detection methods. This work fills a critical capability gap in small road hazards detection for high-speed autonomous vehicles.
引用
收藏
页码:400 / 412
页数:13
相关论文
共 63 条
[1]  
[Anonymous], 2017, INT C COMP VIS
[2]  
[Anonymous], 2015, Fast rCnn,, DOI DOI 10.48550/ARXIV.1504.08083
[3]  
[Anonymous], 2016, ARXIV160906694
[4]   SOD-MTGAN: Small Object Detection via Multi-Task Generative Adversarial Network [J].
Bai, Yancheng ;
Zhang, Yongqiang ;
Ding, Mingli ;
Ghanem, Bernard .
COMPUTER VISION - ECCV 2018, PT XIII, 2018, 11217 :210-226
[5]   Visual objects in context [J].
Bar, M .
NATURE REVIEWS NEUROSCIENCE, 2004, 5 (08) :617-629
[6]   Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks [J].
Bell, Sean ;
Zitnick, C. Lawrence ;
Bala, Kavita ;
Girshick, Ross .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2874-2883
[7]  
Berg A.C, 2017, ARXIV PREPRINT ARXIV
[8]   PERCEIVING REAL-WORLD SCENES [J].
BIEDERMA.I .
SCIENCE, 1972, 177 (4043) :77-&
[9]  
Bosquet B., 2018, P BMVC
[10]   Feature-Fused SSD: Fast Detection for Small Objects [J].
Cao, Guimei ;
Xie, Xuemei ;
Yang, Wenzhe ;
Liao, Quan ;
Shi, Guangming ;
Wu, Jinjian .
NINTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2017), 2018, 10615