Counting From Sky: A Large-Scale Data Set for Remote Sensing Object Counting and a Benchmark Method

被引:43
作者
Gao, Guangshuai [1 ,2 ]
Liu, Qingjie [1 ,2 ]
Wang, Yunhong [1 ,2 ]
机构
[1] Beihang Univ, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China
[2] Beihang Univ, Hangzhou Innovat Inst, Hangzhou 310051, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2021年 / 59卷 / 05期
基金
中国国家自然科学基金;
关键词
Attention mechanism; deformable convolution layer; object counting; remote sensing; scale pyramid module (SPM);
D O I
10.1109/TGRS.2020.3020555
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Object counting, whose aim is to estimate the number of objects from a given image, is an important and challenging computation task. Significant efforts have been devoted to addressing this problem and achieved great progress, yet counting the number of ground objects from remote sensing images is barely studied. In this article, we are interested in counting dense objects from remote sensing images. Compared with object counting in a natural scene, this task is challenging in the following factors: large-scale variation, complex cluttered background, and orientation arbitrariness. More importantly, the scarcity of data severely limits the development of research in this field. To address these issues, we first construct a large-scale object counting data set with remote sensing images, which contains four important geographic objects: buildings, crowded ships in harbors, and large vehicles and small vehicles in parking lots. We then benchmark the data set by designing a novel neural network that can generate a density map of an input image. The proposed network consists of three parts, namely attention module, scale pyramid module, and deformable convolution module (DCM) to attack the aforementioned challenging factors. Extensive experiments are performed on the proposed data set and one crowd counting data set, which demonstrates the challenges of the proposed data set and the superiority and effectiveness of our method compared with state-of-the-art methods.
引用
收藏
页码:3642 / 3655
页数:14
相关论文
共 76 条
  • [1] Young and mature oil palm tree detection and counting using convolutional neural network deep learning method
    Abd Mubin, Nurulain
    Nadarajoo, Eiswary
    Shafri, Helmi Zulhaidi Mohd
    Hamedianfar, Alireza
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2019, 40 (19) : 7500 - 7515
  • [2] [Anonymous], 2016, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2016.596
  • [3] Counting in the Wild
    Arteta, Carlos
    Lempitsky, Victor
    Zisserman, Andrew
    [J]. COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 : 483 - 498
  • [4] Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
  • [5] Bahmanyar R., 2019, BMVCW
  • [6] A Variational Level-Set Method for Unsupervised Change Detection in Remote Sensing Images
    Bazi, Yakoub
    Melgani, Farid
    [J]. 2009 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, VOLS 1-5, 2009, : 1235 - +
  • [7] CrowdNet: A Deep Convolutional Network for Dense Crowd Counting
    Boominathan, Lokesh
    Kruthiventi, Srinivas S. S.
    Babu, R. Venkatesh
    [J]. MM'16: PROCEEDINGS OF THE 2016 ACM MULTIMEDIA CONFERENCE, 2016, : 640 - 644
  • [8] Bradbury J., 2019, Advances in Neural Information Processing, VVolume 32, DOI DOI 10.48550/ARXIV.1912.01703
  • [9] Scale Aggregation Network for Accurate and Efficient Crowd Counting
    Cao, Xinkun
    Wang, Zhipeng
    Zhao, Yanyun
    Su, Fei
    [J]. COMPUTER VISION - ECCV 2018, PT V, 2018, 11209 : 757 - 773
  • [10] SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning
    Chen, Long
    Zhang, Hanwang
    Xiao, Jun
    Nie, Liqiang
    Shao, Jian
    Liu, Wei
    Chua, Tat-Seng
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6298 - 6306