Multi-Level Refinement Feature Pyramid Network for Scale Imbalance Object Detection

被引:3
|
作者
Aziz, Lubna [1 ,2 ]
Salam, Md Sah Bin Haji [1 ]
Sheikh, Usman Ullah [3 ]
Khan, Surat [4 ]
Ayub, Huma [5 ]
Ayub, Sara [4 ]
机构
[1] Univ Teknol Malaysia UTM, Sch Comp, Div Art Intelligence, Fac Engn, Skudai 81310, Kagawa, Malaysia
[2] Balochistan Univ Informat Technol Engn & Manageme, Fac Informat & Commun Technol, Dept Comp Engn, Quetta 87300, Pakistan
[3] Univ Teknol Malaysia UTM, Sch Elect Engn, Fac Engn, Skudai 81310, Kagawa, Malaysia
[4] Balochistan Univ Informat Technol Engn & Manageme, Fac Informat & Commun Technol, Dept Elect Engn, Quetta 87300, Pakistan
[5] Sardar Bahadur Khan Woman Univ, Dept Chem & Technol, Quetta 86301, Pakistan
关键词
Object detection; feature pyramid; convolutional neural network; computer vision;
D O I
10.1109/ACCESS.2021.3130129
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Object detection becomes a challenge due to diversity of object scales. In general, modern object detectors use feature pyramid to learn multi-scale representation for better results. However, current versions of feature pyramid are insufficient to handle scale imbalance, as it is inefficient to integrate semantic information across different scales. Here, we reformulate feature pyramid construction as a feature reconfiguration process. We propose a detection network, Multi-level Refinement Feature pyramid Network, to combine high-level features (i.e., semantic information), middle-level feature and low-level feature (i.e., boundary information), in a highly-nonlinear yet efficient manner. A novel contextual features module is proposed, which consists of global attention and local reconfigurations. It efficiently gathers task-oriented contextual features across different scales and spatial locations (i.e., lightweight local reconfiguration and global attention). To evaluate significance of proposed model, we designed and trained end-to-end single stage detector called MRFDet by assimilating it into Single Shot Detector (SSD), and it achieved better detection performance compared to most recent single-stage objects detectors. MRFDet achieves an AP of 45.2 with MS-COCO and an improvement in mAP of 4.5% with VOC.
引用
收藏
页码:156492 / 156506
页数:15
相关论文
共 50 条
  • [1] Multi-level refinement enriched feature pyramid network for object detection
    Aziz, Lubna
    Salam, Md. Sah Bin Haji F. C.
    Ayub, Sara
    IMAGE AND VISION COMPUTING, 2021, 115
  • [2] Multi-level refinement enriched feature pyramid network for object detection
    Aziz, Lubna
    FC, Md. Sah Bin Haji Salam
    Ayub, Sara
    Image and Vision Computing, 2021, 115
  • [3] Multi-level feature fusion pyramid network for object detection
    Guo, Zebin
    Shuai, Hui
    Liu, Guangcan
    Zhu, Yisheng
    Wang, Wenqing
    VISUAL COMPUTER, 2023, 39 (09): : 4267 - 4277
  • [4] Multi-level feature fusion pyramid network for object detection
    Zebin Guo
    Hui Shuai
    Guangcan Liu
    Yisheng Zhu
    Wenqing Wang
    The Visual Computer, 2023, 39 : 4267 - 4277
  • [5] Feature refinement with multi-level context for object detection
    Yingdong Ma
    Yanan Wang
    Machine Vision and Applications, 2023, 34
  • [6] Feature refinement with multi-level context for object detection
    Ma, Yingdong
    Wang, Yanan
    MACHINE VISION AND APPLICATIONS, 2023, 34 (04)
  • [7] MLA-Net: Feature Pyramid Network with Multi-Level Local Attention for Object Detection
    Yang, Xiaobao
    Wang, Wentao
    Wu, Junsheng
    Ding, Chen
    Ma, Sugang
    Hou, Zhiqiang
    MATHEMATICS, 2022, 10 (24)
  • [8] Multi-Scale Residual Aggregation Feature Pyramid Network for Object Detection
    Wang, Hongyang
    Wang, Tiejun
    ELECTRONICS, 2023, 12 (01)
  • [9] Pyramid attention object detection network with multi-scale feature fusion
    Chen, Xiu
    Li, Yujie
    Nakatoh, Yoshihisa
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 104
  • [10] Multi-level feature enhancement network for object detection in sonar images
    Zhou, Xin
    Zhou, Zihan
    Wang, Manying
    Ning, Bo
    Wang, Yanhao
    Zhu, Pengli
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 100