共 36 条
An Adaptive Region Proposal Network With Progressive Attention Propagation for Tiny Person Detection From UAV Images
被引:5
|作者:
Yu, Youjiang
[1
,2
]
Zhang, Kaibing
[1
,2
]
Wang, Xiaohua
[2
]
Wang, Nannan
[3
]
Gao, Xinbo
[4
]
机构:
[1] Xian Polytech Univ, Shaanxi Key Lab Clothing Intelligence, Sch Comp Sci, Xian 710048, Peoples R China
[2] Xian Polytech Univ, Sch Elect & Informat, Xian 710048, Peoples R China
[3] Xidian Univ, State Key Lab Integrated Serv Networks, Xian 710071, Peoples R China
[4] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Image Cognit, Chongqing 400065, Peoples R China
基金:
中国国家自然科学基金;
关键词:
Proposals;
Feature extraction;
Object detection;
Detectors;
Autonomous aerial vehicles;
Visualization;
Task analysis;
Tiny object detection;
region proposal network;
attention mechanism;
OBJECT DETECTION;
PYRAMID NETWORK;
D O I:
10.1109/TCSVT.2023.3335157
中图分类号:
TM [电工技术];
TN [电子技术、通信技术];
学科分类号:
0808 ;
0809 ;
摘要:
Two-stage detectors, which consist of the multi-scale feature representations and the prediction of region proposal boxes, have been recognized as an effective paradigm for tiny object detection in Unmanned Aerial Vehicle (UAV) images. Although most previous methods primarily concentrated on developing efficient feature fusion strategies within the feature pyramid network (FPN), few studies elaborated on improving the performance of region proposal network (RPN). Conventional RPNs exhibit two key weaknesses in the majority of existing two-stage object detection approaches. Firstly, the quality of proposal boxes generated by the RPN is heavily reliant on rich feature representations extracted from the FPN backbone. Secondly, the fixed number of generated proposal boxes limits adaptability to the distribution of tiny person objects. To mitigate the aforementioned problems, in this paper we propose a novel adaptive region proposal network (ARPN) to improve the quality of the proposal boxes and generate particularly compact yet accurate proposal boxes. On one hand, a progressive attention mechanism is devised to make the ARPN focus more on prospective object regions, where a series of multi-scale front attention modules (FAM) are applied to coarsely filter out most of irrelevant background areas and a group of top-to-bottom back attention modules (BAM) aid the ARPN to finely pinpoint tiny objects of interest in a coarse-to-fine manner. On the other hand, a mini-density map, which is inspired by the philosophy of crowd counting, is elaborately designed to adaptively determine the number of region proposal boxes. This approach significantly reduces redundancy while maintaining high-quality proposal boxes. Extensive experiments verify the superiority of proposed ARPN and show obvious improvement over other competitors in terms of two performance indicators of average precision (AP) and average recall (AR). The code will be available at https://github.com/kbzhang0505/ARPN.
引用
收藏
页码:4392 / 4406
页数:15
相关论文