SAN: Learning Relationship Between Convolutional Features for Multi-scale Object Detection

被引:37
|
作者
Kim, Yonghyun [1 ]
Kang, Bong-Nam [2 ]
Kim, Daijin [1 ]
机构
[1] POSTECH, Dept Comp Sci & Engn, Pohang, South Korea
[2] POSTECH, Dept Creat IT Engn, Pohang, South Korea
来源
COMPUTER VISION - ECCV 2018, PT V | 2018年 / 11209卷
关键词
Scale Aware Network; Object detection; Multi scale; Neural network; STATISTICS;
D O I
10.1007/978-3-030-01228-1_20
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most of the recent successful methods in accurate object detection build on the convolutional neural networks (CNN). However, due to the lack of scale normalization in CNN-based detection methods, the activated channels in the feature space can be completely different according to a scale and this difference makes it hard for the classifier to learn samples. We propose a Scale Aware Network (SAN) that maps the convolutional features from the different scales onto a scale-invariant subspace to make CNN-based detection methods more robust to the scale variation, and also construct a unique learning method which considers purely the relationship between channels without the spatial information for the efficient learning of SAN. To show the validity of our method, we visualize how convolutional features change according to the scale through a channel activation matrix and experimentally show that SAN reduces the feature differences in the scale space. We evaluate our method on VOC PASCAL and MS COCO dataset. We demonstrate SAN by conducting several experiments on structures and parameters. The proposed SAN can be generally applied to many CNN-based detection methods to enhance the detection accuracy with a slight increase in the computing time.
引用
收藏
页码:328 / 343
页数:16
相关论文
共 50 条
  • [31] DYNAMIC MULTI-SCALE LOSS BALANCE FOR OBJECT DETECTION
    Luo, Yihao
    Cao, Xiang
    Zhang, Juntao
    Cheng, Peng
    Wang, Tianjiang
    Feng, Qi
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4873 - 4877
  • [32] StairsNet: Mixed Multi-scale Network for Object Detection
    Gao, Weiyi
    Cao, Wenlong
    Zhai, Jian
    Rui, Jianwu
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT I, 2018, 10735 : 303 - 314
  • [33] Dynamic multi-scale loss optimization for object detection
    Yihao Luo
    Xiang Cao
    Juntao Zhang
    Peng Cheng
    Tianjiang Wang
    Qi Feng
    Multimedia Tools and Applications, 2023, 82 : 2349 - 2367
  • [34] AUTONOMOUS MULTI-SCALE OBJECT DETECTION WITH HOUGH FORESTS
    Scalzo, Maria
    Velipasalar, Senem
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 1643 - 1647
  • [35] Multi-scale redistribution feature pyramid for object detection
    Qian, Huifang
    Guo, Jiahao
    Zhou, Xuan
    AI COMMUNICATIONS, 2022, 35 (01) : 15 - 30
  • [36] MGFPN: Enhancing multi-scale feature for object detection
    He, Weiming
    Wu, You
    Xiao, Jing
    Cao, Yang
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (06) : 11171 - 11181
  • [37] Multi-scale coupled attention for visual object detection
    Li, Fei
    Yan, Hongping
    Shi, Linsu
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [38] Object Detection Using Multi-Scale Balanced Sampling
    Yu, Hang
    Gong, Jiulu
    Chen, Derong
    APPLIED SCIENCES-BASEL, 2020, 10 (17):
  • [39] MRMNet: Multi-scale residual multi-branch neural network for object detection
    Dong, Yongsheng
    Liu, Yafeng
    Li, Xuelong
    NEUROCOMPUTING, 2024, 596
  • [40] Rotation-aware and multi-scale convolutional neural network for object detection in remote sensing images
    Fu, Kun
    Chang, Zhonghan
    Zhang, Yue
    Xu, Guangluan
    Zhang, Keshu
    Sun, Xian
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 161 (161) : 294 - 308