Multi-scale Vertical Cross-layer Feature Aggregation and Attention Fusion Network for Object Detection

被引:2
|
作者
Gao, Wenting [1 ]
Li, Xiaojuan [1 ]
Han, Yu [1 ]
Liu, Yue [1 ]
机构
[1] Beijing Inst Technol, Sch Opt & Photon, Beijing Engn Res Ctr Mixed Real & Adv Display, Beijing 100081, Peoples R China
关键词
Deep learning; Object detection; Attention mechanism;
D O I
10.1007/978-3-031-15937-4_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scale imbalance is one of the primary limitations for object detection. To tackle such a problem, existing methods such as FPN usually integrate the features at different scales, which suffers from the inconsistence of different high-level and low-level features due to the straightforward combination. In this paper, we propose a multi-scale vertical cross-layer feature aggregation and attention fusion network which not only has bottom-up and top-down pathways with lateral connections, but also adds cross-layer paths in the vertical direction. The proposed method can boost information flow and shorten the information path between high-level and low-level features. An attention fusion module is also introduced to obtain the internal correlation between local, global and contextual information of other feature layers. In order to optimize the anchor configurations, a differential evolution algorithm is employed to reconfigure the ratios and scales of anchors. Experimental results show that the proposed method achieves superior detection performance on the public dataset PASCAL VOC.
引用
收藏
页码:139 / 150
页数:12
相关论文
共 50 条
  • [1] Cross-Layer Feature Attention Module for Multi-scale Object Detection
    Zheng, Haotian
    Pang, Cheng
    Lan, Rushi
    ARTIFICIAL INTELLIGENCE AND ROBOTICS, ISAIR 2022, PT II, 2022, 1701 : 202 - 210
  • [2] Pyramid attention object detection network with multi-scale feature fusion
    Chen, Xiu
    Li, Yujie
    Nakatoh, Yoshihisa
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 104
  • [3] Multi-scale cross-layer fusion and center position network for pedestrian detection
    Liu, Qian
    Qi, Youwei
    Wang, Cunbao
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2024, 36 (01)
  • [4] CFANet: A Cross-layer Feature Aggregation Network for Camouflaged Object Detection
    Zhang, Qing
    Yan, Weiqi
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2441 - 2446
  • [5] Small Object Detection using Multi-scale Feature Fusion and Attention
    Liu, Baokai
    Du, Shiqiang
    Li, Jiacheng
    Wang, Jianhua
    Liu, Wenjie
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 7246 - 7251
  • [6] A Multi-Feature Fusion and Attention Network for Multi-Scale Object Detection in Remote Sensing Images
    Cheng, Yong
    Wang, Wei
    Zhang, Wenjie
    Yang, Ling
    Wang, Jun
    Ni, Huan
    Guan, Tingzhao
    He, Jiaxin
    Gu, Yakang
    Tran, Ngoc Nguyen
    REMOTE SENSING, 2023, 15 (08)
  • [7] An improved YOLOv5 method for large objects detection with multi-scale feature cross-layer fusion network
    Qu, Zhong
    Gao, Le-yuan
    Wang, Sheng-ye
    Yin, Hao-nan
    Yi, Tu-ming
    IMAGE AND VISION COMPUTING, 2022, 125
  • [8] Multi-Scale Residual Aggregation Feature Pyramid Network for Object Detection
    Wang, Hongyang
    Wang, Tiejun
    ELECTRONICS, 2023, 12 (01)
  • [9] MULTI-SCALE OBJECT DETECTION WITH FEATURE FUSION AND REGION OBJECTNESS NETWORK
    Guan, Wenjie
    Zou, YueXian
    Zhou, Xiaoqun
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2596 - 2600
  • [10] Multi-Scale Object Detection Using Feature Fusion Recalibration Network
    Guo, Ziyuan
    Zhang, Weimin
    Liang, Zhenshuo
    Shi, Yongliang
    Huang, Qiang
    IEEE ACCESS, 2020, 8 : 51664 - 51673