Simultaneously Detecting and Counting Dense Vehicles From Drone Images

被引:76
作者
Li, Wei [1 ]
Li, Hongliang [1 ]
Wu, Qingbo [1 ]
Chen, Xiaoyu [1 ]
Ngan, King Ngi [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 611731, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
Convolutional neural networks (CNNs); deep learning; intelligent vehicles; object detection; object recognition; unmanned aerial vehicles; vehicle detection; MUTUAL INFORMATION; FEATURES;
D O I
10.1109/TIE.2019.2899548
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Unmanned aerial vehicles are an essential component in the realization of Industry 4.0. With drones helping to improve industrial safety and efficiency in utilities, construction, and communication, there is an urgent need for drone-based intelligent applications. In this paper, we develop a unified framework to simultaneously detect and count vehicles from drone images. We first explore why the state-of-the-art detectors fail in highly dense drone scenes, which provides more appropriate insights. Then, we propose an effective loss to push the anchors toward matching the ground-truth boxes asmuch as possible, specifically designed for scale-adaptive anchor generation. Inspired by attention mechanisms in the human visual system, we maximize the mutual information between object classes and features by combining bottom-up cues with top-down attention mechanisms specifically designed for feature extraction. Finally, we build a counting layer with a regularized constraint related to the number of vehicles. Extensive experiments demonstrate the effectiveness of our approach. For both tasks, our proposed method achieves state-of-the-art results on all four challenging datasets. In particular, our results reduce error by a larger factor than previous methods.
引用
收藏
页码:9651 / 9662
页数:12
相关论文
共 48 条
[1]   Fast Classification of Empty and Occupied Parking Spaces Using Integral Channel Features [J].
Ahrnbom, Martin ;
Astrom, Kalle ;
Nilsson, Mikael .
PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016), 2016, :1609-1615
[2]  
[Anonymous], 2018, COMPUTER VISION PATT
[3]  
[Anonymous], 2017, IEEE I CONF COMP VIS, DOI DOI 10.1109/ICCV.2017.322
[4]  
[Anonymous], 2014, P INT C LEARN REPR W
[5]  
[Anonymous], PROC CVPR IEEE
[6]  
[Anonymous], ADV NEURAL INFORM PR, DOI DOI 10.1109/TPAMI.2016.2577031
[7]   Feature selection using Joint Mutual Information Maximisation [J].
Bennasar, Mohamed ;
Hicks, Yulia ;
Setchi, Rossitza .
EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (22) :8520-8532
[8]   CrowdNet: A Deep Convolutional Network for Dense Crowd Counting [J].
Boominathan, Lokesh ;
Kruthiventi, Srinivas S. S. ;
Babu, R. Venkatesh .
MM'16: PROCEEDINGS OF THE 2016 ACM MULTIMEDIA CONFERENCE, 2016, :640-644
[9]  
Borji A, 2012, PROC CVPR IEEE, P438, DOI 10.1109/CVPR.2012.6247706
[10]   Look and Think Twice: Capturing Top-Down Visual Attention with Feedback Convolutional Neural Networks [J].
Cao, Chunshui ;
Liu, Xianming ;
Yang, Yi ;
Yu, Yinan ;
Wang, Jiang ;
Wang, Zilei ;
Huang, Yongzhen ;
Wang, Liang ;
Huang, Chang ;
Xu, Wei ;
Ramanan, Deva ;
Huang, Thomas S. .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2956-2964