CHANNEL-SPATIAL FUSION AWARE NET FOR ACCURATE AND FAST OBJECT DETECTION

被引:0
作者
Wu, Linhuang [1 ]
Yang, Xiujun [1 ]
Fan, Zhenjia [1 ]
Wang, Chunjun [2 ]
Chen, Zhifeng [1 ]
机构
[1] Fuzhou Univ, Coll Phys & Informat, Fuzhou, Peoples R China
[2] Univ Chicago, Chicago, IL 60637 USA
来源
2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2020年
关键词
object detection; fusion awareness;
D O I
暂无
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
A major challenge of object detection is that accurate detector is limited by speed due to enormous network, while the lightweight detector can reach real-time but its weak representation ability leads to the expense of accuracy. To overcome the issue, we propose a channel-spatial fusion awareness module (CSFA) to improve the accuracy by enhancing the feature representation of network at the negligible cost of complexity. Given a feature map, our method exploits two parts sequentially, channel awareness and spatial awareness, to reconstruct feature map without deepening the network. Because of the property of CSFA for easy integrating into any layer of CNN architectures, we assemble this module into ResNet-18 and DLA-34 in CenterNet to form a CSFA detector. Results consistently show that CSFA-Net runs in a fairly fast speed, and achieves state-of-the-art, i.e., mAP of 81.12% on VOC2007 and AP of 43.2% on COCO.
引用
收藏
页码:758 / 762
页数:5
相关论文
共 22 条
  • [1] [Anonymous], 2017, FSSD FEATURE FUSION
  • [2] Chen S., 2019, ARXIV190406883
  • [3] Dai JF, 2016, ADV NEUR IN, V29
  • [4] Deformable Convolutional Networks
    Dai, Jifeng
    Qi, Haozhi
    Xiong, Yuwen
    Li, Yi
    Zhang, Guodong
    Hu, Han
    Wei, Yichen
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 764 - 773
  • [5] The Pascal Visual Object Classes (VOC) Challenge
    Everingham, Mark
    Van Gool, Luc
    Williams, Christopher K. I.
    Winn, John
    Zisserman, Andrew
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) : 303 - 338
  • [6] Rich feature hierarchies for accurate object detection and semantic segmentation
    Girshick, Ross
    Donahue, Jeff
    Darrell, Trevor
    Malik, Jitendra
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587
  • [7] He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/ICCV.2017.322, 10.1109/TPAMI.2018.2844175]
  • [8] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [9] Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/TPAMI.2019.2913372, 10.1109/CVPR.2018.00745]
  • [10] ImageNet Classification with Deep Convolutional Neural Networks
    Krizhevsky, Alex
    Sutskever, Ilya
    Hinton, Geoffrey E.
    [J]. COMMUNICATIONS OF THE ACM, 2017, 60 (06) : 84 - 90