Object counting in remote sensing via selective spatial-frequency pyramid network

被引:4
作者
Chen, Jinyong [1 ]
Gao, Mingliang [1 ]
Guo, Xiangyu [1 ]
Zhai, Wenzhe [1 ]
Li, Qilei [2 ]
Jeon, Gwanggil [3 ]
机构
[1] Shandong Univ Technol, Sch Elect & Elect Engn, Zibo, Peoples R China
[2] Queen Mary Univ London, Sch Elect Engn & Comp Sci, London, England
[3] Incheon Natl Univ, Dept Embedded Syst Engn, Incheon, South Korea
关键词
attention mechanism; background clutter; edge computing; object counting; remote sensing; scale variation; SCALE;
D O I
10.1002/spe.3287
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The integration of remote sensing object counting in the Mobile Edge Computing (MEC) environment is of crucial significance and practical value. However, the presence of significant background interference in remote sensing images poses a challenge to accurate object counting, as the results are easily affected by background noise. Additionally, scale variation within remote sensing images presents a further difficulty, as traditional counting methods face challenges in adapting to objects of different scales. To address these challenges, we propose a selective spatial-frequency pyramid network (SSFPNet). Specifically, the SSFPNet consists of two core modules, namely the pyramid attention (PA) module and the hybrid feature pyramid (HFP) module. The PA module accurately extracts target regions and eliminates background interference by operating on four parallel branches. This enables more precise object counting. The HFP module is introduced to fuse spatial and frequency domain information, leveraging scale information from different domains for object counting, so as to improve the accuracy and robustness of counting. Experimental results on RSOC, CARPK, and PUCPR+ benchmark datasets demonstrate that the SSFPNet achieves state-of-the-art performance in terms of accuracy and robustness.
引用
收藏
页码:1754 / 1773
页数:20
相关论文
共 58 条
[1]   Proactive content caching in edge computing environment: A review [J].
Aghazadeh, Rafat ;
Shahidinejad, Ali ;
Ghobaei-Arani, Mostafa .
SOFTWARE-PRACTICE & EXPERIENCE, 2023, 53 (03) :811-855
[2]  
Boski M, 2017, 2017 10TH INTERNATIONAL WORKSHOP ON MULTIDIMENSIONAL (ND) SYSTEMS (NDS)
[3]   Is the edge really necessary for drone computing offloading? An experimental assessment in carrier-grade 5G operator networks [J].
Candal-Ventureira, David ;
Javier Gonzalez-Castano, Francisco ;
Gil-Castineira, Felipe ;
Fondo-Ferreiro, Pablo .
SOFTWARE-PRACTICE & EXPERIENCE, 2023, 53 (03) :579-599
[4]   Scale Aggregation Network for Accurate and Efficient Crowd Counting [J].
Cao, Xinkun ;
Wang, Zhipeng ;
Zhao, Yanyun ;
Su, Fei .
COMPUTER VISION - ECCV 2018, PT V, 2018, 11209 :757-773
[5]   Bayesian Poisson Regression for Crowd Counting [J].
Chan, Antoni B. ;
Vasconcelos, Nuno .
2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, :545-551
[6]   Scale Pyramid Network for Crowd Counting [J].
Chen, Xinya ;
Bin, Yanrui ;
Sang, Nong ;
Gao, Changxin .
2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, :1941-1950
[7]   Road extraction in remote sensing data: A survey [J].
Chen, Ziyi ;
Deng, Liai ;
Luo, Yuhua ;
Li, Dilong ;
Marcato Junior, Jose ;
Gonsalves, Wesley Nunes ;
Nurunnabi, Abdul Awal Md ;
Li, Jonathan ;
Wang, Cheng ;
Li, Deren .
INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2022, 112
[8]   Object Counting and Instance Segmentation with Image-level Supervision [J].
Cholakkal, Hisham ;
Sun, Guolei ;
Khan, Fahad Shahbaz ;
Shao, Ling .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :12389-12397
[9]   CentroidNetV2: A hybrid deep neural network for small-object segmentation and counting [J].
Dijkstra, Klaas ;
van de Loosdrecht, Jaap ;
Atsma, Waatze A. ;
Schomaker, Lambert R. B. ;
Wiering, Marco A. .
NEUROCOMPUTING, 2021, 423 :490-505
[10]   Pedestrian Detection: An Evaluation of the State of the Art [J].
Dollar, Piotr ;
Wojek, Christian ;
Schiele, Bernt ;
Perona, Pietro .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (04) :743-761