An Adaptive Region Proposal Network With Progressive Attention Propagation for Tiny Person Detection From UAV Images

被引:5
|
作者
Yu, Youjiang [1 ,2 ]
Zhang, Kaibing [1 ,2 ]
Wang, Xiaohua [2 ]
Wang, Nannan [3 ]
Gao, Xinbo [4 ]
机构
[1] Xian Polytech Univ, Shaanxi Key Lab Clothing Intelligence, Sch Comp Sci, Xian 710048, Peoples R China
[2] Xian Polytech Univ, Sch Elect & Informat, Xian 710048, Peoples R China
[3] Xidian Univ, State Key Lab Integrated Serv Networks, Xian 710071, Peoples R China
[4] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Image Cognit, Chongqing 400065, Peoples R China
基金
中国国家自然科学基金;
关键词
Proposals; Feature extraction; Object detection; Detectors; Autonomous aerial vehicles; Visualization; Task analysis; Tiny object detection; region proposal network; attention mechanism; OBJECT DETECTION; PYRAMID NETWORK;
D O I
10.1109/TCSVT.2023.3335157
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Two-stage detectors, which consist of the multi-scale feature representations and the prediction of region proposal boxes, have been recognized as an effective paradigm for tiny object detection in Unmanned Aerial Vehicle (UAV) images. Although most previous methods primarily concentrated on developing efficient feature fusion strategies within the feature pyramid network (FPN), few studies elaborated on improving the performance of region proposal network (RPN). Conventional RPNs exhibit two key weaknesses in the majority of existing two-stage object detection approaches. Firstly, the quality of proposal boxes generated by the RPN is heavily reliant on rich feature representations extracted from the FPN backbone. Secondly, the fixed number of generated proposal boxes limits adaptability to the distribution of tiny person objects. To mitigate the aforementioned problems, in this paper we propose a novel adaptive region proposal network (ARPN) to improve the quality of the proposal boxes and generate particularly compact yet accurate proposal boxes. On one hand, a progressive attention mechanism is devised to make the ARPN focus more on prospective object regions, where a series of multi-scale front attention modules (FAM) are applied to coarsely filter out most of irrelevant background areas and a group of top-to-bottom back attention modules (BAM) aid the ARPN to finely pinpoint tiny objects of interest in a coarse-to-fine manner. On the other hand, a mini-density map, which is inspired by the philosophy of crowd counting, is elaborately designed to adaptively determine the number of region proposal boxes. This approach significantly reduces redundancy while maintaining high-quality proposal boxes. Extensive experiments verify the superiority of proposed ARPN and show obvious improvement over other competitors in terms of two performance indicators of average precision (AP) and average recall (AR). The code will be available at https://github.com/kbzhang0505/ARPN.
引用
收藏
页码:4392 / 4406
页数:15
相关论文
共 36 条
  • [31] Attention-Guided Multi-Branch Convolutional Neural Network for Mitosis Detection From Histopathological Images
    Lei, Haijun
    Liu, Shaomin
    Elazab, Ahmed
    Gong, Xuehao
    Lei, Baiying
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (02) : 358 - 370
  • [32] A New Deep Learning Network for Automatic Bridge Detection from SAR Images Based on Balanced and Attention Mechanism
    Chen, Lifu
    Weng, Ting
    Xing, Jin
    Pan, Zhouhao
    Yuan, Zhihui
    Xing, Xuemin
    Zhang, Peng
    REMOTE SENSING, 2020, 12 (03)
  • [33] Small Object Detection in UAV Remote Sensing Images Based on Intra-Group Multi-Scale Fusion Attention and Adaptive Weighted Feature Fusion Mechanism
    Yuan, Zhe
    Gong, Jianglei
    Guo, Baolong
    Wang, Chao
    Liao, Nannan
    Song, Jiawei
    Wu, Qiming
    REMOTE SENSING, 2024, 16 (22)
  • [34] ARAA-Net: Adaptive Region-Aware Attention Network for Epiphysis and Articular Surface Segmentation From Hand Radiographs
    Deng, Yamei
    Song, Ting
    Wang, Xu
    Liao, Yong
    Chen, Yonglu
    He, Qian
    Yao, Yun
    Huang, Jianwei
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 14
  • [35] DCA-DAFFNet: An End-to-End Network With Deformable Fusion Attention and Deep Adaptive Feature Fusion for Laryngeal Tumor Grading From Histopathology Images
    Luo, Jiayang
    Huang, Pan
    He, Peng
    Wei, Biao
    Guo, Xiaodong
    Xiao, Hualiang
    Sun, Yuchun
    Tian, Sukun
    Zhou, Mi
    Feng, Peng
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72 : 1 - 15
  • [36] AMFNet: an attention-based multi-level feature fusion network for ground objects extraction from mining area's UAV-based RGB images and digital surface model
    Li, Jin
    Cai, Xiang
    Qi, Jiandong
    JOURNAL OF APPLIED REMOTE SENSING, 2021, 15 (03)