Semantic Information Feature Aggregation Network for Object Detection in Remote Sensing Images

被引：0

作者：

Guo, Zhe ^{[1
,2
]}

Bi, Guoling ^{[2
]}

Lv, Hengyi ^{[2
]}

Zhao, Yuchen ^{[2
]}

Han, Lintao ^{[1
,2
]}

机构：

[1] Univ Chinese Acad Sci, Sch Optoelect, Beijing 100049, Peoples R China

[2] Chinese Acad Sci, Changchun Inst Opt Fine Mech & Phys, Changchun 130033, Peoples R China

来源：

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS | 2024年 / 21卷

基金：

中国国家自然科学基金;

关键词：

Feature extraction; Remote sensing; Semantics; Convolution; Object detection; Task analysis; Head; Adaptive feature extraction module (AFEM); object detection; remote sensing scene; semantic information; tridirectional feature fusion module (TRFFM);

D O I：

10.1109/LGRS.2024.3406345

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

Object detection is a crucial but challenging task in remote sensing images. Thanks to the emergence of convolutional neural networks (CNNs), object detection has made significant progress. However, there are still two significant issues that must be addressed: 1) since small targets are distributed at any angle, the features extracted by traditional convolution are incomplete and 2) the objects in remote sensing images are small and dense, resulting in missed detections and false detections during the detection process. In this letter, we innovatively propose to obtain more semantic information to help remote sensing detection tasks solve these two problems. To achieve this goal, we design two novel modules: an adaptive feature extraction module (AFEM) and a tridirectional feature fusion module (TRFFM) to improve detection capabilities in small target-dense scenarios. More specifically, AFEM combines local features with global features to adaptively fit the receptive field of rotating targets. TRFFM establishes multiple paths between different layers of the feature pyramid and uses a weighted special fusion mechanism to obtain higher quality feature maps. Extensive experiments on two challenging remote datasets, optical remote sensing images (DIOR) and VisDrone2019, results reached 75.4% AP50 and 33.2% AP, respectively, which verified the superiority of our method in terms of accuracy and adaptability. The code has been open-sourced at https://github.com/GGD777/SIFANet.

引用

页码：1 / 5

页数：5

共 28 条

[1]

[Anonymous], 2020, ZENODO, DOI DOI 10.5281/ZENODO.3958273

[2] A GNSS/LiDAR/IMU Pose Estimation System Based on Collaborative Fusion of Factor Map and Filtering [J].

Chen, Honglin ;

Wu, Wei ;

Zhang, Si ;

Wu, Chaohong ;

Zhong, Ruofei .

REMOTE SENSING, 2023, 15 (03)

[3] Deformable Convolutional Networks [J].

Dai, Jifeng ;

Qi, Haozhi ;

Xiong, Yuwen ;

Li, Yi ;

Zhang, Guodong ;

Hu, Han ;

Wei, Yichen .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773

[4] VisDrone-SOT2019: The Vision Meets Drone Single Object Tracking Challenge Results [J].

Du, Dawei ;

Zhu, Pengfei ;

Wen, Longyin ;

Bian, Xiao ;

Ling, Haibin ;

Hu, Qinghua ;

Zheng, Jiayu ;

Peng, Tao ;

Wang, Xinyao ;

Zhang, Yue ;

Bo, Liefeng ;

Shi, Hailin ;

Zhu, Rui ;

Han, Bo ;

Zhang, Chunhui ;

Liu, Guizhong ;

Wu, Han ;

Wen, Hao ;

Wang, Haoran ;

Fan, Jiaqing ;

Chen, Jie ;

Gao, Jie ;

Zhang, Jie ;

Zhou, Jinghao ;

Zhou, Jinliu ;

Wang, Jinwang ;

Wan, Jiuqing ;

Kittler, Josef ;

Zhang, Kaihua ;

Huang, Kaiqi ;

Yang, Kang ;

Zhang, Kangkai ;

Huang, Lianghua ;

Zhou, Lijun ;

Shi, Lingling ;

Ding, Lu ;

Wang, Ning ;

Wang, Peng ;

Hu, Qintao ;

Laganiere, Robert ;

Ma, Ruiyan ;

Zhang, Ruohan ;

Zou, Shanrong ;

Zhao, Shengwei ;

Li, Shengyang ;

Zhu, Shengyin ;

Li, Shikun ;

Ge, Shiming ;

Xuan, Shiyu ;

Xu, Tianyang .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, :199-212

[5] Instances as Queries [J].

Fang, Yuxin ;

Yang, Shusheng ;

Wang, Xinggang ;

Li, Yu ;

Fang, Chen ;

Shan, Ying ;

Feng, Bin ;

Liu, Wenyu .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :6890-6899

[6]

Ge Z, 2021, Arxiv, DOI [arXiv:2107.08430, DOI 10.48550/ARXIV.2107.08430]

[7] NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection [J].

Ghiasi, Golnaz ;

Lin, Tsung-Yi ;

Le, Quoc V. .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :7029-7038

[8]

He Kaiming., P IEEE INT C COMPUTE, P2961, DOI DOI 10.48550/ARXIV.1703.06870

[9] Object detection in optical remote sensing images: A survey and a new benchmark [J].

Li, Ke ;

Wan, Gang ;

Cheng, Gong ;

Meng, Liqiu ;

Han, Junwei .

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 159 :296-307

[10] Cross-Layer Attention Network for Small Object Detection in Remote Sensing Imagery [J].

Li, Yangyang ;

Huang, Qin ;

Pei, Xuan ;

Chen, Yanqiao ;

Jiao, Licheng ;

Shang, Ronghua .

IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 :2148-2161

← 1 2 3 →