Robust Vehicle Detection in High-Resolution Aerial Images With Imbalanced Data

被引:0
作者
Li X. [1 ]
Li X. [1 ]
Li Z. [3 ]
Xiong X. [4 ]
Khyam M.O. [5 ]
Sun C. [1 ]
机构
[1] The Laboratory of Measurement, Control of CSE, Ministry of Education, School of Automation, Southeast University, Nanjing
[2] School of Cyber Science, and Engineering, Southeast University, Nanjing
[3] The Department of Automation, University of Science, Technology of China, Hefei
[4] The Department of Faculty of Science, McGill University, Montréal, H3A 0G4, QC
[5] The School of Engineering and Technology, Central Queensland University, Melbourne, 3000, VIC
来源
IEEE Transactions on Artificial Intelligence | 2021年 / 2卷 / 03期
基金
中国国家自然科学基金;
关键词
Bag-based single-stage detector (BSSD); class imbalance problem; data augmentation; vehicle detection;
D O I
10.1109/tai.2021.3081057
中图分类号
学科分类号
摘要
Vehicle detection in images from unmanned aerial vehicles (UAVs) plays an important role in traffic surveillance and urban planning due to the popularity of UAVs. However, the class imbalance problem is an important factor that restricts the performance of vehicle detectors. There are two types of class imbalance in UAV images, i.e., foreground-background imbalance and foreground-foreground imbalance. For anchor-based single stage detector, as many ground truths cannot be assigned to corresponding anchors because of low intersection over union, it makes the foreground-background imbalance problem more severe. Therefore, we propose a novel bag-based single-stage detector, which treats each position on the feature map as a bag. A simple and adaptive definition of bags is proposed along with the positive sample definition method, which is utilized to ensure more ground truths can be assigned to proper bags. In addition, we utilize online hard example mining method to control the proportion of positive and negative samples during the training process. To address the foreground-foreground imbalance, we propose a novel data augmentation algorithm, which allows us to create appropriate visual context for under-represented class. Extensive experiments demonstrate the superiority of the proposed algorithm, compared with other state-of-the-art solutions. Impact Statement-Recently, unmanned aerial vehicles (UAVs) are widely used in intelligent transportation due to their low price and high flexibility, which makes vehicle detection in UAV images important for automatically gathering of traffic information. However, the class imbalance problem, which is common in object detection where some classes have far fewer frequencies in the dataset, has an adverse effect on the performance of vehicle detectors. The data augmentation method and deep learning based vehicle detector proposed in this article are able to reduce the negative impact and improve detection performance by at least 1.27% in mean average precision index. In addition, compared with algorithms with similar detection performance, our method is at least 15 ms faster. The proposed method can benefit users in a wide variety of applications including UAV transportation, traffic surveillance, and urban planning. © 2021 IEEE.
引用
收藏
页码:238 / 250
页数:12
相关论文
共 48 条
[1]  
Tao C., Mi L., Li Y., Qi J., Xiao Y., Zhang J., Scene context-driven vehicle detection in high-resolution aerial images, IEEE Trans. Geosci. Remote Sens., 57, 10, pp. 7339-7351, (2019)
[2]  
Tang J., Liu F., Zou Y., Zhang W., Wang Y., An improved fuzzy neural network for traffic speed prediction considering periodic characteristic, IEEE Trans. Intell. Transp. Syst., 18, 9, pp. 2340-2350, (2017)
[3]  
Cheng G., Zhou P., Han J., Learning rotation-invariant convolutional neural networks for object detection in VHR optical remote sens. images, IEEE Trans. Geosci. Remote Sens., 54, 12, pp. 7405-7415, (2016)
[4]  
Cao X., Wu C., Lan J., Yan P., Li X., Vehicle detection and motion analysis in low-altitude airborne video under urban environment, IEEE Trans. Circuits Syst. Video Technol., 21, 10, pp. 1522-1533, (2011)
[5]  
Minaeian S., Liu J., Son Y., Effective and efficient detection of moving targets from a UAV's camera, IEEE Trans. Intell. Transp. Syst., 19, 2, pp. 497-506, (2018)
[6]  
Cheng G., Han J., Zhou P., Xu D., Learning rotation-invariant and fisher discriminative convolutional neural networks for object detection, IEEE Trans. Image Process., 28, 1, pp. 265-278, (2019)
[7]  
Ke R., Li Z., Tang J., Pan Z., Wang Y., Real-time traffic flow parameter estimation from UAV video based on ensemble classifier and optical flow, IEEE Trans. Intell. Transp. Syst., 20, 1, pp. 54-64, (2019)
[8]  
Zhou H., Kong H., Wei L., Creighton D., Nahavandi S., On detecting road regions in a single UAV image, IEEE Trans. Intell. Trans. Syst., 18, 7, pp. 1713-1722, (2017)
[9]  
Oksuz K., Cam B.C., Kalkan S., Akbas E., Imbalance problems in object detection: A review, IEEE Trans. Pattern Anal. Mach. Intell.
[10]  
Ouyang W., Wang X., Zhang C., Yang X., Factors in finetuning deep model for object detection with long-tail distribution, Proc. IEEE Conf. Comput. Vis. Pattern Recognit., pp. 864-873, (2016)