Object counting in remote sensing via selective spatial-frequency pyramid network

被引：5

作者：

Chen, Jinyong ^{[1
]}

Gao, Mingliang ^{[1
]}

Guo, Xiangyu ^{[1
]}

Zhai, Wenzhe ^{[1
]}

Li, Qilei ^{[2
]}

Jeon, Gwanggil ^{[3
]}

机构：

[1] Shandong Univ Technol, Sch Elect & Elect Engn, Zibo, Peoples R China

[2] Queen Mary Univ London, Sch Elect Engn & Comp Sci, London, England

[3] Incheon Natl Univ, Dept Embedded Syst Engn, Incheon, South Korea

来源：

SOFTWARE-PRACTICE & EXPERIENCE | 2024年 / 54卷 / 09期

关键词：

attention mechanism; background clutter; edge computing; object counting; remote sensing; scale variation; SCALE;

D O I：

10.1002/spe.3287

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

The integration of remote sensing object counting in the Mobile Edge Computing (MEC) environment is of crucial significance and practical value. However, the presence of significant background interference in remote sensing images poses a challenge to accurate object counting, as the results are easily affected by background noise. Additionally, scale variation within remote sensing images presents a further difficulty, as traditional counting methods face challenges in adapting to objects of different scales. To address these challenges, we propose a selective spatial-frequency pyramid network (SSFPNet). Specifically, the SSFPNet consists of two core modules, namely the pyramid attention (PA) module and the hybrid feature pyramid (HFP) module. The PA module accurately extracts target regions and eliminates background interference by operating on four parallel branches. This enables more precise object counting. The HFP module is introduced to fuse spatial and frequency domain information, leveraging scale information from different domains for object counting, so as to improve the accuracy and robustness of counting. Experimental results on RSOC, CARPK, and PUCPR+ benchmark datasets demonstrate that the SSFPNet achieves state-of-the-art performance in terms of accuracy and robustness.

引用

页码：1754 / 1773

页数：20

共 58 条

[31]

Lian Z., SOFTW PRACT EXP

[32] SSD: Single Shot MultiBox Detector [J].

Liu, Wei ;

Anguelov, Dragomir ;

Erhan, Dumitru ;

Szegedy, Christian ;

Reed, Scott ;

Fu, Cheng-Yang ;

Berg, Alexander C. .

COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 :21-37

[33] Context-Aware Crowd Counting [J].

Liu, Weizhe ;

Salzmann, Mathieu ;

Fua, Pascal .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5094-5103

[34] A deep learning approach to the screening of malaria infection: Automated and rapid cell counting, object detection and instance segmentation using Mask R-CNN [J].

Loh, De Rong ;

Wen Xin Yong ;

Yapeter, Jullian ;

Subburaj, Karupppasamy ;

Chandramohanadas, Rajesh .

COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2021, 88

[35] Bayesian Loss for Crowd Count Estimation with Point Supervision [J].

Ma, Zhiheng ;

Wei, Xing ;

Hong, Xiaopeng ;

Gong, Yihong .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6141-6150

[36]

Mao YJ, 2017, INT GEOL REV, V59, P1276, DOI [10.1109/COMST.2017.2745201, 10.1080/00206814.2016.1209435]

[37]

Mundhenk TN., 2016, P EUROPEAN C COMPUTE

[38] FcaNet: Frequency Channel Attention Networks [J].

Qin, Zequn ;

Zhang, Pengyi ;

Wu, Fei ;

Li, Xi .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :763-772

[39] Urban planning and building smart cities based on the Internet of Things using Big Data analytics [J].

Rathore, M. Mazhar ;

Ahmad, Awais ;

Paul, Anand ;

Rho, Seungmin .

COMPUTER NETWORKS, 2016, 101 :63-80

[40] You Only Look Once: Unified, Real-Time Object Detection [J].

Redmon, Joseph ;

Divvala, Santosh ;

Girshick, Ross ;

Farhadi, Ali .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :779-788

← 1 2 3 4 5 6 →