Scene understanding based on Multi-Scale Pooling of deep learning features

被引:0
|
作者
Li, DongYang [1 ]
Zhou, Yue [1 ]
机构
[1] Shanghai Jiao Tong Univ, Inst Image Proc & Pattern Recognit, Shanghai 200240, Peoples R China
来源
PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTOMATION, MECHANICAL CONTROL AND COMPUTATIONAL ENGINEERING | 2015年 / 124卷
关键词
CNNs; MOP-CNN; SPP-net; Scenes understanding;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep convolutional neural networks (CNNs) have recently shown impressive performance as generic representation for recognition. However, the feature extracted from global CNNs lack geometric invariance, which limits their robustness for classification and detection of highly variable objects. To improve the invariance of the features without degrading their discriminative power and speed up the calculation, we follow the next two method. Firstly, we adopt the scheme called multi-scale orderless pooling (MOP-CNN) which extracts CNNs activation from local patches of the image at multiple scale levels, performs orderless VLAD pooling of these activations at each level separately, and concatenates the result. Second, to speed up the calculation, we adapt the SPP-net as the CNNs architecture. Using SPP-net, we compute the feature maps from the entire image only once, and then pool features in arbitrary regions (sub-images) to generate fixed-length representations for training the detectors. This method avoids repeatedly computing the convolutional features. On the challenging SUN397 Scenes classification datasets, our method achieves competitive classification results.
引用
收藏
页码:1732 / 1737
页数:6
相关论文
共 50 条
  • [1] Multi-scale Orderless Pooling of Deep Convolutional Activation Features
    Gong, Yunchao
    Wang, Liwei
    Guo, Ruiqi
    Lazebnik, Svetlana
    COMPUTER VISION - ECCV 2014, PT VII, 2014, 8695 : 392 - 407
  • [2] Outdoor Scene Understanding Based on Multi-Scale PBA Image Features and Point Cloud Features
    Liu, Yisha
    Gu, Yufeng
    Yan, Fei
    Zhuang, Yan
    SENSORS, 2019, 19 (20)
  • [3] A CNN-Based Multi-Scale Pooling Strategy for Acoustic Scene Classification
    Huang, Rong
    Xie, Yue
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2024, E107D (01) : 153 - 156
  • [4] Scene Text Detection Based on Multi-Scale Pooling and Bidirectional Feature Fusion
    Wei, Zheliang
    Li, Yueyang
    Luo, Haichi
    Computer Engineering and Applications, 2024, 60 (02) : 154 - 161
  • [5] Understanding Mobility via Deep Multi-Scale Learning
    Zhang, Rui
    Xie, Peng
    Wang, Chen
    Liu, Gaoyang
    Wan, Shaohua
    2018 INTERNATIONAL CONFERENCE ON IDENTIFICATION, INFORMATION AND KNOWLEDGE IN THE INTERNET OF THINGS, 2019, 147 : 487 - 494
  • [6] Scene classification of remote sensing image based on deep network and multi-scale features fusion
    Yang, Zhou
    Mu, Xiao-dong
    Zhao, Feng-an
    OPTIK, 2018, 171 : 287 - 293
  • [7] Deep learning resistivity inversion method based on multi-scale edge features
    Liu Zheng-yu
    Pang Yong-hao
    Zhang Feng-kai
    Wan Yong
    Liu Lei
    Cai Yu-mei
    Liu Jia-wen
    ROCK AND SOIL MECHANICS, 2023, 44 (11) : 3299 - 3306
  • [8] No-Reference Image Quality Assessment with Multi-Scale Orderless Pooling of Deep Features
    Varga, Domonkos
    JOURNAL OF IMAGING, 2021, 7 (07)
  • [9] Multi-scale Pyramid Pooling for Deep Convolutional Representation
    Yoo, Donggeun
    Park, Sunggyun
    Lee, Joon-Young
    Kweon, In So
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2015,
  • [10] Multi-scale stacking attention pooling for remote sensing scene classification
    Bi, Qi
    Zhang, Han
    Qin, Kun
    NEUROCOMPUTING, 2021, 436 : 147 - 161