Learning Discriminated Features Based on Feature Pyramid Networks and Attention for Multi-scale Object Detection

被引:2
作者
Lu, Yunhua [1 ]
Su, Minghui [1 ]
Wang, Yong [1 ]
Liu, Zhi [1 ]
Peng, Tao [1 ]
机构
[1] Chongqing Univ Technol, Sch Artificial Intelligence, Chongqing 400054, Peoples R China
关键词
Object detection; Multi-scale; Feature pyramid; Discriminative learning; Attention mechanism;
D O I
10.1007/s12559-022-10052-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As the research scene in object detection becomes increasingly complex, the extracted feature information needs to be further improved. Many multi-scale feature pyramid network methods have been proposed to improve detection accuracy. However, most of them just follow a simple chain aggregation structure, resulting in not considering the distinction between multi-scale objects. Modern cognitive research presents that human cognitive ability is not a simple image-based matching process. It has an inherent process of information decomposition and reconstruction. Inspired by this theory, a new feature pyramid network model denoted as SuFPN based on discriminative learning is proposed to solve the problem of multi-scale object detection. In SuFPN, the correlation between the underlying location information and the deep feature information is fully considered. Firstly, object features are extracted through top-down path and lateral connection. Then deformable convolution is used to extract object discriminant spatial information. Finally, the attention mechanism is introduced to generate a discriminative feature map with enhanced spatial and channel interdependence, which provides excellent location information for the feature pyramid while considering semantic information. The proposed SuFPN is validated on the PASCAL VOC and COCO datasets. The Average Precision (AP) value reaches 80.0 on the PASCAL VOC dataset, which is 1.7 points higher than the feature pyramid networks (FPN), and 39.2 on the COCO dataset, which is 1.8 points higher than the FPN. The result demonstrates that our SuFPN outperforms other advanced methods in the multi-scale detection precision.
引用
收藏
页码:486 / 495
页数:10
相关论文
共 50 条
  • [41] Multi-scale Object Detection Algorithm in Smart City Based on Mixed Dilated Convolution Pyramid
    Yin, Kangning
    Liang, Jie
    Hou, Shaoqi
    Zhu, Rui
    Yin, Guangqiang
    Wang, Chunyu
    Yang, Xu
    2021 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, INTERNET OF PEOPLE, AND SMART CITY INNOVATIONS (SMARTWORLD/SCALCOM/UIC/ATC/IOP/SCI 2021), 2021, : 590 - 597
  • [42] FPDT: a multi-scale feature pyramidal object detection transformer
    Huang, Kailai
    Wen, Mi
    Wang, Chen
    Ling, Lina
    JOURNAL OF APPLIED REMOTE SENSING, 2023, 17 (02)
  • [43] Scale-Insensitive Object Detection via Attention Feature Pyramid Transformer Network
    Lingling Li
    Changwen Zheng
    Cunli Mao
    Haibo Deng
    Taisong Jin
    Neural Processing Letters, 2022, 54 : 581 - 595
  • [44] An Efficient UAV Image Object Detection Algorithm Based on Global Attention and Multi-Scale Feature Fusion
    Qian, Rui
    Ding, Yong
    ELECTRONICS, 2024, 13 (20)
  • [45] Remote Sensing Small Object Detection Network Based on Attention Mechanism and Multi-Scale Feature Fusion
    Qu, Junsuo
    Tang, Zongbing
    Zhang, Le
    Zhang, Yanghai
    Zhang, Zhenguo
    REMOTE SENSING, 2023, 15 (11)
  • [46] Scale-Insensitive Object Detection via Attention Feature Pyramid Transformer Network
    Li, Lingling
    Zheng, Changwen
    Mao, Cunli
    Deng, Haibo
    Jin, Taisong
    NEURAL PROCESSING LETTERS, 2022, 54 (01) : 581 - 595
  • [47] Multi-Scale Adversarial Feature Learning for Saliency Detection
    Zhu, Dandan
    Dai, Lei
    Luo, Ye
    Zhang, Guokai
    Shao, Xuan
    Itti, Laurent
    Lu, Jianwei
    SYMMETRY-BASEL, 2018, 10 (10):
  • [48] Multi-scale Vertical Cross-layer Feature Aggregation and Attention Fusion Network for Object Detection
    Gao, Wenting
    Li, Xiaojuan
    Han, Yu
    Liu, Yue
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT IV, 2022, 13532 : 139 - 150
  • [49] MULTI-SCALE REINFORCEMENT LEARNING STRATEGY FOR OBJECT DETECTION
    Luo, Yihao
    Cao, Xiang
    Zhang, Juntao
    Pan, Leixilan
    Wang, Tianjiang
    Feng, Qi
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2015 - 2019
  • [50] Multi-Scale Feature Fusion Based Adaptive Object Detection for UAV
    Liu Fang
    Wu Zhiwei
    Yang Anzhe
    Han Xiao
    ACTA OPTICA SINICA, 2020, 40 (10)