Joint-attention feature fusion network and dual-adaptive NMS for object detection

被引:38
作者
Ma, Wentao [1 ]
Zhou, Tongqing [1 ]
Qin, Jiaohua [2 ]
Zhou, Qingyang [2 ]
Cai, Zhiping [1 ]
机构
[1] Natl Univ Def Technol, Coll Comp, Changsha 410073, Hunan, Peoples R China
[2] Cent South Univ Forestry & Technol, Coll Comp Sci & Informat Technol, Changsha 410004, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
Object detection; Joint-attention; Adaptive NMS;
D O I
10.1016/j.knosys.2022.108213
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Attention mechanisms and Non-Maximum Suppression (NMS) have proven to be effective components in object detection. However, feature fusion of different scales and layers based on a single attention mechanism cannot always yield gratifying performance, and may introduce redundant information that makes the results worse than expected. NMS methods, on the other hand, generally face the single-constant threshold dilemma, namely, a lower threshold leads to the miss of highly overlapped instance objects while a higher one brings in more false positives. Therefore, how to optimize different dimensions of correlation in feature mapping and how to adaptively set the NMS threshold still hinder effective object detection. While independently addressing each will cause suboptimal detection, this paper proposes to feed the informative feature representation from a joint-attention feature fusion network into adaptive NMS for a comprehensive performance enhancement. Specifically, we embed two types of attention modules in a three-level Feature Pyramid Network (FPN): the channel-attention module is adopted for enhanced feature representation by re-evaluating relationships between channels from a global perspective; the position-attention module is used to exploit the correlation between features to discover rich contextual feature information. Furthermore, we develop dual-adaptive NMS to dynamically adjust the suppression thresholds according to instance objects density, namely, the threshold rises as instance objects gather and decays when objects appear sparsely. The proposed method is evaluated on the COCO dataset and extensive experimental results demonstrate its superior performance compared with existing methods. (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Infrared Maritime Object Detection Network With Feature Enhancement and Adjacent Fusion
    Zhang, Meng
    Dong, Lili
    Gao, Yulin
    Wang, Yichen
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 5750 - 5760
  • [42] MLFFNet: Multilevel Feature Fusion Network for Object Detection in Sonar Images
    Wang, Zhen
    Guo, Jianxin
    Zeng, Leya
    Zhang, Chuanlei
    Wang, Buhong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [43] FULLY CONVOLUTIONAL NETWORK WITH DENSELY FEATURE FUSION MODELS FOR OBJECT DETECTION
    Huang, Shouzhi
    Li, Xiaoyu
    Jiang, Zhuqing
    Guo, Xiaoqiang
    Men, Aidong
    2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW 2018), 2018,
  • [44] Multi-level feature fusion pyramid network for object detection
    Zebin Guo
    Hui Shuai
    Guangcan Liu
    Yisheng Zhu
    Wenqing Wang
    The Visual Computer, 2023, 39 : 4267 - 4277
  • [45] Progressive Feature Fusion and Refinement Network for Substation Rotating Object Detection
    Qu, Luyao
    Zhu, Xinshan
    Li, Bin
    Guo, Zhimin
    Liu, Hao
    Mao, Wandeng
    2022 4TH INTERNATIONAL CONFERENCE ON SMART POWER & INTERNET ENERGY SYSTEMS, SPIES, 2022, : 2356 - 2360
  • [46] Joint Spatial and Temporal Feature Enhancement Network for Disturbed Object Detection
    Zhang, Fan
    Ji, Hongbing
    Zhang, Yongquan
    Zhu, Zhigang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (12) : 12258 - 12273
  • [47] Progressive Dual-Attention Residual Network for Salient Object Detection
    Zhang, Liqian
    Zhang, Qing
    Zhao, Rui
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) : 5902 - 5915
  • [48] Sequential Feature Fusion for Object Detection
    Wang, Qiang
    Han, Yahong
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT I, 2018, 11164 : 689 - 699
  • [49] Multi-Scale Feature Fusion Based Adaptive Object Detection for UAV
    Liu Fang
    Wu Zhiwei
    Yang Anzhe
    Han Xiao
    ACTA OPTICA SINICA, 2020, 40 (10)
  • [50] ALFPN: Adaptive Learning Feature Pyramid Network for Small Object Detection
    Chen, Haolin
    Wang, Qi
    Ruan, Weijian
    Zhu, Jingxiang
    Lei, Liang
    Wu, Xue
    Hao, Gefei
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2023, 2023