High-Resolution Feature Pyramid Network for Small Object Detection on Drone View

被引：38

作者：

Chen, Zhaodong ^{[1
,2
]}

Ji, Hongbing ^{[1
,2
]}

Zhang, Yongquan ^{[1
,2
]}

Zhu, Zhigang ^{[1
,2
]}

Li, Yifan ^{[1
,2
]}

机构：

[1] Xidian Univ, Xian Key Lab Intelligent Spectrum Sensing & Inform, Xian 710071, Peoples R China

[2] Xidian Univ, Shaanxi Union Res Ctr Univ & Enterprise Intelligen, Xian 710071, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2024年 / 34卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Object detection on drone view; small object detector; high-resolution feature; multiple-in-single-out feature pyramid network; CONTEXT;

D O I：

10.1109/TCSVT.2023.3286896

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Object detection has developed rapidly with the help of deep learning technologies recent years. However, object detection on drone view remains challenging due to two main reasons: (1) It is difficult to detect small-scale objects lacking detailed information. (2) The diversity of camera angles of drones brings dramatic differences in object scale. Although feature pyramid network (FPN) alleviates the problem caused by scale difference to some extent, it also retains some worthless features, which wastes resources and slows down the speed. In this work, we propose a novel High-Resolution Feature Pyramid Network (HR-FPN) to improve the detection accuracy of small-scale objects and avoid feature redundancy. The key components of HR-FPN include a high-resolution feature alignment module (HRFA), a high-resolution feature fusion module (HRFF) and a multi-scale decoupled head (MSDH). HRFA feeds multi-scale features from backbone into parallel resampling channels to obtain high-resolution features at the same scale. HRFF establishes a bottom-up path to distribute context-rich low-level semantic information to all layers that are then aggregated into classification feature and localization feature. MSDH cope with the scale difference of objects by predicting the categories and locations corresponding to different scales of objects separately. Moreover, we train model by scale-weighted loss to focus more on small-scale objects. Extensive experiments and comprehensive evaluations demonstrate the effectiveness and advancement of our approach.

引用

页码：475 / 489

页数：15

共 83 条

[1]

Bochkovskiy A, 2020, Arxiv, DOI arXiv:2004.10934

[2] Urban Traffic Monitoring and Analysis Using Unmanned Aerial Vehicles (UAVs): A Systematic Literature Review [J].

Butila, Eugen Valentin ;

Boboc, Razvan Gabriel .

REMOTE SENSING, 2022, 14 (03)

[3] Cascade R-CNN: High Quality Object Detection and Instance Segmentation [J].

Cai, Zhaowei ;

Vasconcelos, Nuno .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (05) :1483-1498

[4] Feature Aggregation Networks Based on Dual Attention Capsules for Visual Object Tracking [J].

Cao, Yi ;

Ji, Hongbing ;

Zhang, Wenbo ;

Shirani, Shahram .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (02) :674-689

[5] Prime Sample Attention in Object Detection [J].

Cao, Yuhang ;

Chen, Kai ;

Loy, Chen Change ;

Lin, Dahua .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11580-11588

[6] You Only Look One-level Feature [J].

Chen, Qiang ;

Wang, Yingming ;

Yang, Tong ;

Zhang, Xiangyu ;

Cheng, Jian ;

Sun, Jian .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :13034-13043

[7] Disentangle Your Dense Object Detector [J].

Chen, Zehui ;

Yang, Chenhongyi ;

Li, Qiaofei ;

Zhao, Feng ;

Zha, Zheng-Jun ;

Wu, Feng .

PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, :4939-4948

[8] Deformable Convolutional Networks [J].

Dai, Jifeng ;

Qi, Haozhi ;

Xiong, Yuwen ;

Li, Yi ;

Zhang, Guodong ;

Hu, Han ;

Wei, Yichen .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773

[9] Dynamic Head: Unifying Object Detection Heads with Attentions [J].

Dai, Xiyang ;

Chen, Yinpeng ;

Xiao, Bin ;

Chen, Dongdong ;

Liu, Mengchen ;

Yuan, Lu ;

Zhang, Lei .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :7369-7378

[10] Extended Feature Pyramid Network for Small Object Detection [J].

Deng, Chunfang ;

Wang, Mengmeng ;

Liu, Liang ;

Liu, Yong ;

Jiang, Yunliang .

IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 :1968-1979

← 1 2 3 4 5 6 7 8 9 →