A Spatial Layout and Scale Invariant Feature Representation for Indoor Scene Classification

被引:57
|
作者
Hayat, Munawar [1 ]
Khan, Salman H. [2 ,3 ]
Bennamoun, Mohammed [4 ]
An, Senjian [4 ]
机构
[1] Univ Canberra, Bruce, ACT 2617, Australia
[2] CSIRO, Data61, Canberra, ACT 0200, Australia
[3] Australian Natl Univ, Canberra, ACT 0200, Australia
[4] Univ Western Australia, Crawley, WA 6009, Australia
基金
澳大利亚研究理事会;
关键词
Indoor scenes classification; spatial layout variations; scale invariance;
D O I
10.1109/TIP.2016.2599292
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unlike standard object classification, where the image to be classified contains one or multiple instances of the same object, indoor scene classification is quite different since the image consists of multiple distinct objects. Furthermore, these objects can be of varying sizes and are present across numerous spatial locations in different layouts. For automatic indoor scene categorization, large-scale spatial layout deformations and scale variations are therefore two major challenges and the design of rich feature descriptors which are robust to these challenges is still an open problem. This paper introduces a new learnable feature descriptor called "spatial layout and scale invariant convolutional activations" to deal with these challenges. For this purpose, a new convolutional neural network architecture is designed which incorporates a novel "spatially unstructured" layer to introduce robustness against spatial layout deformations. To achieve scale invariance, we present a pyramidal image representation. For feasible training of the proposed network for images of indoor scenes, this paper proposes a methodology, which efficiently adapts a trained network model (on a large-scale data) for our task with only a limited amount of available training data. The efficacy of the proposed approach is demonstrated through extensive experiments on a number of data sets, including MIT-67, Scene-15, Sports-8, Graz-02, and NYU data sets.
引用
收藏
页码:4829 / 4841
页数:13
相关论文
共 50 条
  • [21] Deep feature extraction and motion representation for satellite video scene classification
    Yanfeng Gu
    Huan Liu
    Tengfei Wang
    Shengyang Li
    Guoming Gao
    Science China Information Sciences, 2020, 63
  • [22] Classification of Indoor/Outdoor Scene
    Raja, R.
    Roomi, S. Md. Mansoor
    Dharmalakshmi, D.
    Rohini, S.
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (ICCIC), 2013, : 735 - 738
  • [23] Extrapolating spatial layout in scene representations
    Monica S. Castelhano
    Alexander Pollatsek
    Memory & Cognition, 2010, 38 : 1018 - 1025
  • [24] FINGERPRINT CLASSIFICATION USING SCALE-INVARIANT FEATURE TRANSFORM
    Sobek, Jakub
    Cetnarowicz, Damian
    Dabrowski, Adam
    SPA 2011: SIGNAL PROCESSING ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS CONFERENCE PROCEEDINGS, 2011, : 100 - 105
  • [25] Extrapolating spatial layout in scene representations
    Castelhano, Monica S.
    Pollatsek, Alexander
    MEMORY & COGNITION, 2010, 38 (08) : 1018 - 1025
  • [26] Rapid Invariant Encoding of Scene Layout in Human OPA
    Henriksson, Linda
    Mur, Marieke
    Kriegeskorte, Nikolaus
    NEURON, 2019, 103 (01) : 161 - +
  • [27] Spatial-Temporal Invariant Contrastive Learning for Remote Sensing Scene Classification
    Huang, Haozhe
    Mou, Zhongfeng
    Li, Yunying
    Li, Qiujun
    Chen, Jie
    Li, Haifeng
    IEEE Geoscience and Remote Sensing Letters, 2022, 19
  • [28] SCENE IMAGE CLASSIFICATION USING REDUCED VIRTUAL FEATURE REPRESENTATION IN SPARSE FRAMEWORK
    Sharma, Krishan
    Gupta, Shikha
    Dileep, A. D.
    Rameshan, Renu
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2701 - 2705
  • [29] Spatial-Temporal Invariant Contrastive Learning for Remote Sensing Scene Classification
    Huang, Haozhe
    Mou, Zhongfeng
    Li, Yunying
    Li, Qiujun
    Chen, Jie
    Li, Haifeng
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [30] Feature Extraction and Scene Classification for Remote Sensing Image Based on Sparse Representation
    Guo, Youliang
    Zhang, Junping
    Zhong, Shengwei
    ALGORITHMS, TECHNOLOGIES, AND APPLICATIONS FOR MULTISPECTRAL AND HYPERSPECTRAL IMAGERY XXV, 2019, 10986