Tiny Vehicle Detection for Mid-to-High Altitude UAV Images Based on Visual Attention and Spatial-Temporal Information

被引:11
作者
Yu, Ruonan [1 ]
Li, Hongguang [2 ]
Jiang, Yalong [2 ]
Zhang, Baochang [3 ]
Wang, Yufeng [2 ]
机构
[1] Beihang Univ, Sch Elect & Informat Engn, Beijing 100191, Peoples R China
[2] Beihang Univ, Unmanned Syst Res Inst, Beijing 100191, Peoples R China
[3] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
基金
中国国家自然科学基金;
关键词
mid-to-high altitude UAV images; tiny object detection; visual attention; spatial-temporal information; SMALL TARGET; DIM;
D O I
10.3390/s22062354
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Mid-to-high altitude Unmanned Aerial Vehicle (UAV) imagery can provide important remote sensing information between satellite and low altitude platforms, and vehicle detection in mid-to-high altitude UAV images plays a crucial role in land monitoring and disaster relief. However, the high background complexity of images and limited pixels of objects challenge the performance of tiny vehicle detection. Traditional methods suffer from poor adaptation ability to complex backgrounds, while deep neural networks (DNNs) have inherent defects in feature extraction of tiny objects with finite pixels. To address the issue above, this paper puts forward a vehicle detection method combining the DNNs-based and traditional methods for mid-to-high altitude UAV images. We first employ the deep segmentation network to exploit the co-occurrence of the road and vehicles, then detect tiny vehicles based on visual attention mechanism with spatial-temporal constraint information. Experimental results show that the proposed method achieves effective detection of tiny vehicles in complex backgrounds. In addition, ablation experiments are performed to inspect the effectiveness of each component, and comparative experiments on tinier objects are carried out to prove the superior generalization performance of our method in detecting vehicles with a limited size of 5 x 5 pixels or less.
引用
收藏
页数:18
相关论文
共 52 条
  • [1] Achanta R, 2009, PROC CVPR IEEE, P1597, DOI 10.1109/CVPRW.2009.5206596
  • [2] Enhancement of dim small target through modified top-hat transformation under the condition of heavy clutter
    Bai, Xiangzhi
    Zhou, Fugen
    Jin, Ting
    [J]. SIGNAL PROCESSING, 2010, 90 (05) : 1643 - 1654
  • [3] Okutama-Action: An Aerial View Video Dataset for Concurrent Human Action Detection
    Barekatain, Mohammadamin
    Marti, Miquel
    Shih, Hsueh-Fu
    Murray, Samuel
    Nakayama, Kotaro
    Matsuo, Yutaka
    Prendinger, Helmut
    [J]. 2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 2153 - 2160
  • [4] Dim-Small Target Detection Based on Adaptive Pipeline Filtering
    Biao Li
    Xu Zhiyong
    Zhang, Jianlin
    Wang, Xiangru
    Fan, Xiangsuo
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
  • [5] Bochkovskiy A., 2020, ARXIV 200410934
  • [6] R-CNN for Small Object Detection
    Chen, Chenyi
    Liu, Ming-Yu
    Tuzel, Oncel
    Xiao, Jianxiong
    [J]. COMPUTER VISION - ACCV 2016, PT V, 2017, 10115 : 214 - 230
  • [7] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
    Chen, Liang-Chieh
    Zhu, Yukun
    Papandreou, George
    Schroff, Florian
    Adam, Hartwig
    [J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
  • [8] Chen Y, 2016, INT SYM COMPUT INTEL, P467, DOI [10.1109/ISCID.2016.220, 10.1109/ISCID.2016.2115]
  • [9] An Efficient Infrared Small Target Detection Method Based on Visual Contrast Mechanism
    Chen, Yuwen
    Xin, Yunhong
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2016, 13 (07) : 962 - 966
  • [10] Dai X., 2021, P IEEE CVF C COMP VI