Integrally Mixing Pyramid Representations for Anchor-Free Object Detection in Aerial Imagery

被引:8
作者
Zhang, Cong [1 ]
Xiao, Jun [1 ]
Yang, Cuixin [1 ]
Zhou, Jingchun [2 ]
Lam, Kin-Man [1 ]
Wang, Qi [3 ]
机构
[1] Hong Kong Polytech Univ, Dept Elect & Elect Engn, Hong Kong, Peoples R China
[2] Dalian Maritime Univ, Sch Informat Sci & Technol, Dalian 116026, Peoples R China
[3] Northwestern Polytech Univ, Sch Artificial Intelligence Opt & Elect iOPEN, Xian 710072, Peoples R China
关键词
Feature extraction; Detectors; Routing; Head; Object detection; Logic gates; Convolutional neural networks; Adaptive detection head; aerial images; anchor-free object detection; deep learning; pyramid representations;
D O I
10.1109/LGRS.2024.3404481
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Anchor-free object detectors have recently received increasing research attention in the field of aerial scene object detection, due to their high flexibility and practicality. Anchor-free detectors typically depend on the feature pyramid network (FPN) to alleviate the challenge of significant variations in object scales in aerial contexts. Despite establishing a multiscale feature pyramid, existing FPN-based methods treat each aerial object as an indivisible entity solely managed by a single-scale representation. However, they fail to take into account the distinct characteristics of various components within an instance. To this end, this letter proposes a novel anchor-free detector, namely IMPR-Det, which can integrally mix multiscale pyramid representations for different components of an instance, thus boosting the fine-grained object representation capability. Specifically, IMPR-Det fundamentally introduces a more advanced detection head with an adaptive routing mechanism for pixel-level multiscale feature assignment, instead of previous instance-level assignment. Experimental results demonstrate the superiority of the proposed method over its counterparts, in terms of both accuracy and efficiency, for object detection in aerial images.
引用
收藏
页码:1 / 5
页数:5
相关论文
共 28 条
[1]   Multi-Scale Spatial and Channel-wise Attention for Improving Object Detection in Remote Sensing Imagery [J].
Chen, Jie ;
Wan, Li ;
Zhu, Jingru ;
Xu, Gang ;
Deng, Min .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (04) :681-685
[2]   Cross-Scale Feature Fusion for Object Detection in Optical Remote Sensing Images [J].
Cheng, Gong ;
Si, Yongjie ;
Hong, Hailong ;
Yao, Xiwen ;
Guo, Lei .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (03) :431-435
[3]   Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images [J].
Cheng, Gong ;
Zhou, Peicheng ;
Han, Junwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (12) :7405-7415
[4]   CenterNet: Keypoint Triplets for Object Detection [J].
Duan, Kaiwen ;
Bai, Song ;
Xie, Lingxi ;
Qi, Honggang ;
Huang, Qingming ;
Tian, Qi .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6568-6577
[5]   An Anchor-Free Method Based on Feature Balancing and Refinement Network for Multiscale Ship Detection in SAR Images [J].
Fu, Jiamei ;
Sun, Xian ;
Wang, Zhirui ;
Fu, Kun .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (02) :1331-1344
[6]   An Anchor-Free Network With Density Map and Attention Mechanism for Multiscale Object Detection in Aerial Images [J].
Guo, Yiyou ;
Tong, Xiaohua ;
Xu, Xiong ;
Liu, Sicong ;
Feng, Yongjiu ;
Xie, Huan .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[7]   Improving Training Instance Quality in Aerial Image Object Detection With a Sampling-Balance-Based Multistage Network [J].
Han, Wei ;
Fan, Runyu ;
Wang, Lizhe ;
Feng, Ruyi ;
Li, Fengpeng ;
Deng, Ze ;
Chen, Xiaodao .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (12) :10575-10589
[8]   CornerNet: Detecting Objects as Paired Keypoints [J].
Law, Hei ;
Deng, Jia .
COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 :765-781
[9]   Object detection in optical remote sensing images: A survey and a new benchmark [J].
Li, Ke ;
Wan, Gang ;
Cheng, Gong ;
Meng, Liqiu ;
Han, Junwei .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 159 (159) :296-307
[10]   MEDNet: Multiexpert Detection Network With Unsupervised Clustering of Training Samples [J].
Lin, Qifeng ;
Zhao, Jianhui ;
Du, Bo ;
Fu, Gang ;
Yuan, Zhiyong .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60