Boundary-aware small object detection with attention and interaction

被引:5
作者
Feng, Qihan [1 ]
Shao, Zhiwen [1 ,2 ,3 ]
Wang, Zhixiao [1 ,2 ]
机构
[1] China Univ Min & Technol, Sch Comp Sci & Technol, Xuzhou 221116, Peoples R China
[2] Minist Educ Peoples Republ China, Engn Res Ctr Mine Digitizat, Xuzhou 221116, Peoples R China
[3] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200240, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Boundary-aware network; Spatial interaction; Attentional feature parallel fusion; Small object detection;
D O I
10.1007/s00371-023-03144-x
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Object detection is a critical technology for the intelligent analytical processing of images captured by drones. The objects usually come in various scales and can be extremely small. Existing detection methods are inherently based on pyramid hierarchy architectures to extract multi-scale features and provide better feature representation for small objects. Nevertheless, they inevitably dilute the representation of details in low-level features during top-down feature fusion and are totally unconcerned with whether the fused feature fits the objects of specific scales within a layer. Moreover, the pyramid can only implicitly fuse the spatial context, which makes the fused features cannot receive fine spatial location information for object localization. In this work, we propose an effective boundary-aware network with attention refinement and spatial interaction to tackle the above challenges. Specifically, we first present a highly effective yet simple boundary-aware detection head (BAH), which directly guides representation learning of object structure semantics in the prediction layer to preserve object-related boundary semantics. Additionally, the attentional feature parallel fusion (AFPF) module offers multi-scale feature encoding capability in a parallel triple fusion fashion and adaptively selects features appropriate for objects of certain scales. Furthermore, we design a spatial interactive module (SIM) to preserve fine spatial detail through cross-spatial feature association. Extensive experiments prove that the proposed network significantly outperforms the state-of-the-art methods, in which we achieve 33.1 mAP and 56.5 AP50 on the VisDrone benchmark, 63.4 mAP and 94 AP50 on the NWPU VHR-10 benchmark. The source code will be released.
引用
收藏
页码:5921 / 5934
页数:14
相关论文
共 45 条
  • [1] SLICING AIDED HYPER INFERENCE AND FINE-TUNING FOR SMALL OBJECT DETECTION
    Akyon, Fatih Cagatay
    Altinuc, Sinan Onur
    Temizel, Alptekin
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 966 - 970
  • [2] Carion N, 2020, European conference on computer vision, P213, DOI DOI 10.1007/978-3-030-58452-813
  • [3] Parallel Residual Bi-Fusion Feature Pyramid Network for Accurate Single-Shot Object Detection
    Chen, Ping-Yang
    Chang, Ming-Ching
    Hsieh, Jun-Wei
    Chen, Yong-Sheng
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 9099 - 9111
  • [4] SCPA-Net: Self-calibrated pyramid aggregation for image dehazing
    Chen, Zhihua
    Zhou, Yu
    Li, Ran
    Li, Ping
    Sheng, Bin
    [J]. COMPUTER ANIMATION AND VIRTUAL WORLDS, 2022, 33 (3-4)
  • [5] GPSD: generative parking spot detection using multi-clue recovery model
    Chen, Zhihua
    Qiu, Jun
    Sheng, Bin
    Li, Ping
    Wu, Enhua
    [J]. VISUAL COMPUTER, 2021, 37 (9-11) : 2657 - 2669
  • [6] Multi-class geospatial object detection and geographic image classification based on collection of part detectors
    Cheng, Gong
    Han, Junwei
    Zhou, Peicheng
    Guo, Lei
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2014, 98 : 119 - 132
  • [7] Unmanned aerial systems for photogrammetry and remote sensing: A review
    Colomina, I.
    Molina, P.
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2014, 92 : 79 - 97
  • [8] Dong Zhang, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12373), P323, DOI 10.1007/978-3-030-58604-1_20
  • [9] Adaptive Sparse Convolutional Networks with Global Context Enhancement for Faster Object Detection on Drone Images
    Du, Bowei
    Huang, Yecheng
    Chen, Jiaxin
    Huang, Di
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 13435 - 13444
  • [10] Coarse-grained Density Map Guided Object Detection in Aerial Images
    Duan, Chengzhen
    Wei, Zhiwei
    Zhang, Chi
    Qu, Siying
    Wang, Hongpeng
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 2789 - 2798