SAN: Learning Relationship Between Convolutional Features for Multi-scale Object Detection

被引:37
|
作者
Kim, Yonghyun [1 ]
Kang, Bong-Nam [2 ]
Kim, Daijin [1 ]
机构
[1] POSTECH, Dept Comp Sci & Engn, Pohang, South Korea
[2] POSTECH, Dept Creat IT Engn, Pohang, South Korea
来源
COMPUTER VISION - ECCV 2018, PT V | 2018年 / 11209卷
关键词
Scale Aware Network; Object detection; Multi scale; Neural network; STATISTICS;
D O I
10.1007/978-3-030-01228-1_20
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most of the recent successful methods in accurate object detection build on the convolutional neural networks (CNN). However, due to the lack of scale normalization in CNN-based detection methods, the activated channels in the feature space can be completely different according to a scale and this difference makes it hard for the classifier to learn samples. We propose a Scale Aware Network (SAN) that maps the convolutional features from the different scales onto a scale-invariant subspace to make CNN-based detection methods more robust to the scale variation, and also construct a unique learning method which considers purely the relationship between channels without the spatial information for the efficient learning of SAN. To show the validity of our method, we visualize how convolutional features change according to the scale through a channel activation matrix and experimentally show that SAN reduces the feature differences in the scale space. We evaluate our method on VOC PASCAL and MS COCO dataset. We demonstrate SAN by conducting several experiments on structures and parameters. The proposed SAN can be generally applied to many CNN-based detection methods to enhance the detection accuracy with a slight increase in the computing time.
引用
收藏
页码:328 / 343
页数:16
相关论文
共 50 条
  • [1] Selective Multi-scale Learning for Object Detection
    Chen, Junliang
    Lu, Weizeng
    Shen, Linlin
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT II, 2021, 12892 : 3 - 14
  • [2] MULTI-SCALE REINFORCEMENT LEARNING STRATEGY FOR OBJECT DETECTION
    Luo, Yihao
    Cao, Xiang
    Zhang, Juntao
    Pan, Leixilan
    Wang, Tianjiang
    Feng, Qi
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2015 - 2019
  • [3] Learning Discriminated Features Based on Feature Pyramid Networks and Attention for Multi-scale Object Detection
    Lu, Yunhua
    Su, Minghui
    Wang, Yong
    Liu, Zhi
    Peng, Tao
    COGNITIVE COMPUTATION, 2023, 15 (02) : 486 - 495
  • [4] Learning Discriminated Features Based on Feature Pyramid Networks and Attention for Multi-scale Object Detection
    Yunhua Lu
    Minghui Su
    Yong Wang
    Zhi Liu
    Tao Peng
    Cognitive Computation, 2023, 15 : 486 - 495
  • [5] MULTI-SCALE SHARED FEATURES FOR CASCADE OBJECT DETECTION
    Lin, Zhe
    Hua, Gang
    Davis, Larry S.
    2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 1865 - 1868
  • [6] A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection
    Cai, Zhaowei
    Fan, Quanfu
    Feris, Rogerio S.
    Vasconcelos, Nuno
    COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 : 354 - 370
  • [7] Deep Learning for Multi-scale Object Detection: A Survey
    Chen K.-Q.
    Zhu Z.-L.
    Deng X.-M.
    Ma C.-X.
    Wang H.-A.
    Ruan Jian Xue Bao/Journal of Software, 2021, 32 (04): : 1201 - 1227
  • [8] MsRi-CCF: Multi-Scale and Rotation-Insensitive Convolutional Channel Features for Geospatial Object Detection
    Wu, Xin
    Hong, Danfeng
    Ghamisi, Pedram
    Li, Wei
    Tao, Ran
    REMOTE SENSING, 2018, 10 (12)
  • [9] Multi-scale object detection in remote sensing imagery with convolutional neural networks
    Deng, Zhipeng
    Sun, Hao
    Zhou, Shilin
    Zhao, Juanping
    Lei, Lin
    Zou, Huanxin
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 145 : 3 - 22
  • [10] Multi-scale Dilated Convolutional Neural Network for Object Detection in UAV Images
    Zhang R.
    Shao Z.
    Aleksei P.
    Wang J.
    Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University, 2020, 45 (06): : 895 - 903