MFFSODNet: Multiscale Feature Fusion Small Object Detection Network for UAV Aerial Images

被引:16
|
作者
Jiang, Lingjie [1 ,2 ,3 ]
Yuan, Baoxi [1 ,2 ,3 ]
Du, Jiawei [1 ]
Chen, Boyu [4 ]
Xie, Hanfei [1 ,2 ,3 ]
Tian, Juan [5 ]
Yuan, Ziqi [6 ]
机构
[1] Xijing Univ, Sch Elect Informat, Xian 710123, Peoples R China
[2] Xijing Univ, Xian Key Lab High Precis Ind Intelligent Vis Measu, Xian 710123, Peoples R China
[3] Shaanxi Jiurui Technol Co Ltd, Xian 710065, Shaanxi, Peoples R China
[4] Air Force Engn Univ, Air Traff Control & Ground Control Intercept Coll, Xian 710038, Peoples R China
[5] Xijing Univ, Sch Humanities & Educ, Xian 710123, Peoples R China
[6] Minzu Univ China, Sch Econ, Beijing 100081, Peoples R China
关键词
Deep learning; feature pyramid network (FPN); multiscale feature extraction; small object detection; unmanned aerial vehicle (UAV) aerial image;
D O I
10.1109/TIM.2024.3381272
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Unmanned aerial vehicle (UAV) aerial image object detection is a valuable and challenging research field. Despite the breakthrough of deep learning-based object detection networks in natural scenes, UAV images often exhibit characteristics such as a high proportion of small objects, dense distribution, and significant variations in object scales, posing great challenges for accurate detection. To address these issues, we propose an innovative multiscale feature fusion small object detection network (MFFSODNet). First, concerning the high proportion of small objects in UAV images, an additional tiny object prediction head is introduced instead of the large object prediction head. This approach provides a good detection accuracy of small objects and significantly reduces the parameters. Second, to enhance the feature extraction capability of the network for fine-grained information from small objects, a multiscale feature extraction module (MSFEM) is designed, which could extract rich and valuable multiscale feature information through convolution operation of different scales on multiple branches. Third, to fuse the fine-grained information from shallow feature maps and the semantic information from deep feature maps, a new bidirectional dense feature pyramid network (BDFPN) is proposed. By expanding the feature pyramid network scale and introducing skip connections, BDFPN achieves efficient multiscale information fusion. Extensive experiments on the VisDrone and UAVDT benchmark datasets demonstrate that MFFSODNet outperforms the state-of-the-art object detection methods and further validate the effectiveness and generalization of MFFSODNet on photovoltaic array defect datasets (PVDs).
引用
收藏
页码:1 / 14
页数:14
相关论文
共 50 条
  • [21] Small target detection in drone aerial images based on feature fusion
    Mu, Aiming
    Wang, Huajun
    Meng, Wenjie
    Chen, Yufeng
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (SUPPL 1) : 585 - 598
  • [22] SFFEF-YOLO: Small object detection network based on fine-grained feature extraction and fusion for unmanned aerial images
    Bai, Chenxi
    Zhang, Kexin
    Jin, Haozhe
    Qian, Peng
    Zhai, Rui
    Lu, Ke
    IMAGE AND VISION COMPUTING, 2025, 156
  • [23] Dilated Convolution and Feature Fusion SSD Network for Small Object Detection in Remote Sensing Images
    Qu, Junsuo
    Su, Chang
    Zhang, Zhiwei
    Razi, Abolfazl
    IEEE ACCESS, 2020, 8 : 82832 - 82843
  • [24] Infrared Small UAV Target Detection Based on Depthwise Separable Residual Dense Network and Multiscale Feature Fusion
    Fang, Houzhang
    Ding, Lan
    Wang, Liming
    Chang, Yi
    Yan, Luxin
    Han, Jinhui
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71 : 1 - 20
  • [25] EVMNet: Eagle visual mechanism-inspired lightweight network for small object detection in UAV aerial images
    Chen, Xi
    Lin, Chuan
    DIGITAL SIGNAL PROCESSING, 2025, 158
  • [26] EFFECTIVE FEATURE FUSION NETWORK IN BIFPN FOR SMALL OBJECT DETECTION
    Chen, Jun
    Mai, HongSheng
    Luo, Linbo
    Chen, Xiaoqiang
    Wu, Kangle
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 699 - 703
  • [27] MFO-Net: A Multiscale Feature Optimization Network for UAV Image Object Detection
    Lan, Ziyang
    Zhuang, Fengyuan
    Lin, Zhijie
    Chen, Riqing
    Wei, Lifang
    Lai, Taotao
    Yang, Changcai
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [28] Attentional single-shot network with multi-scale feature fusion for object detection in aerial images
    Wang, Yusheng
    Wang, Hongzhang
    Tang, Eryong
    Liu, Ye
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 4754 - 4758
  • [29] A Non-Local Attention Feature Fusion Network for Multiscale Object Detection
    Wu, Xuke
    Xiong, Gang
    Tian, Bin
    Song, Bing
    Lu, Bo
    Liu, Sheng
    Zhu, Fenghua
    IEEE JOURNAL OF RADIO FREQUENCY IDENTIFICATION, 2022, 6 : 733 - 738
  • [30] Object Detection For Remote Sensing Image Based on Multiscale Feature Fusion Network
    Tian Tingting
    Yang Jun
    LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (16)