A Spatial Layout and Scale Invariant Feature Representation for Indoor Scene Classification

被引:57
|
作者
Hayat, Munawar [1 ]
Khan, Salman H. [2 ,3 ]
Bennamoun, Mohammed [4 ]
An, Senjian [4 ]
机构
[1] Univ Canberra, Bruce, ACT 2617, Australia
[2] CSIRO, Data61, Canberra, ACT 0200, Australia
[3] Australian Natl Univ, Canberra, ACT 0200, Australia
[4] Univ Western Australia, Crawley, WA 6009, Australia
基金
澳大利亚研究理事会;
关键词
Indoor scenes classification; spatial layout variations; scale invariance;
D O I
10.1109/TIP.2016.2599292
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unlike standard object classification, where the image to be classified contains one or multiple instances of the same object, indoor scene classification is quite different since the image consists of multiple distinct objects. Furthermore, these objects can be of varying sizes and are present across numerous spatial locations in different layouts. For automatic indoor scene categorization, large-scale spatial layout deformations and scale variations are therefore two major challenges and the design of rich feature descriptors which are robust to these challenges is still an open problem. This paper introduces a new learnable feature descriptor called "spatial layout and scale invariant convolutional activations" to deal with these challenges. For this purpose, a new convolutional neural network architecture is designed which incorporates a novel "spatially unstructured" layer to introduce robustness against spatial layout deformations. To achieve scale invariance, we present a pyramidal image representation. For feasible training of the proposed network for images of indoor scenes, this paper proposes a methodology, which efficiently adapts a trained network model (on a large-scale data) for our task with only a limited amount of available training data. The efficacy of the proposed approach is demonstrated through extensive experiments on a number of data sets, including MIT-67, Scene-15, Sports-8, Graz-02, and NYU data sets.
引用
收藏
页码:4829 / 4841
页数:13
相关论文
共 50 条
  • [31] Motion Planning for Convertible Indoor Scene Layout Design
    Xiong, Guoming
    Fu, Qiang
    Fu, Hongbo
    Zhou, Bin
    Luo, Guoliang
    Deng, Zhigang
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2021, 27 (12) : 4413 - 4424
  • [32] Indoor Scene Layout Estimation from a Single Image
    Lin, Hung Jin
    Huang, Sheng-Wei
    Lai, Shang-Hong
    Chiang, Chen-Kuo
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 842 - 847
  • [33] A Deep Scene Representation for Aerial Scene Classification
    Zheng, Xiangtao
    Yuan, Yuan
    Lu, Xiaoqiang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (07): : 4799 - 4809
  • [34] Deep-Learning based Global and Semantic Feature Fusion for Indoor Scene Classification
    Pereira, Ricardo
    Goncalves, Nuno
    Garrote, Luis
    Barros, Tiago
    Lopes, Ana
    Nunes, Urbano J.
    2020 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS (ICARSC 2020), 2020, : 67 - 73
  • [35] Scene Classification with the Discriminative Representation
    Sun, Hao
    Chen, Yaxiong
    Chen, Wenjing
    Huang, Zhangcan
    2017 2ND INTERNATIONAL CONFERENCE ON MULTIMEDIA AND IMAGE PROCESSING (ICMIP), 2017, : 267 - 271
  • [36] Classification of Histopathological Images using Scale-Invariant Feature Transform
    Bukala, Andrzej
    Cyganek, Boguslaw
    Koziarski, Michal
    Kwolek, Bogdan
    Olborski, Boguslaw
    Antosz, Zbigniew
    Swadzba, Jakub
    Sitkowski, Piotr
    PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 5: VISAPP, 2020, : 506 - 512
  • [37] Robust Indoor/Outdoor Scene Classification
    Raja, R.
    Roomi, S. Md. Mansoor
    Dharmalakshmi, D.
    2015 EIGHTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION (ICAPR), 2015, : 203 - +
  • [38] Spectral-Spatial Scale Invariant Feature Transform for Hyperspectral Images
    Al-khafaji, Suhad Lateef
    Zhou, Jun
    Zia, Ali
    Liew, Alan Wee-Chung
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (02) : 837 - 850
  • [39] A benchmark for indoor/outdoor scene classification
    Payne, A
    Singh, S
    PATTERN RECOGNITION AND IMAGE ANALYSIS, PT 2, PROCEEDINGS, 2005, 3687 : 711 - 718
  • [40] RETRACTED ARTICLE: A new deep representation for large-scale scene classification
    Bo Dai
    Feng Mei
    Deliang Ji
    Caiyou Zhang
    Jia Shi
    Multimedia Tools and Applications, 2020, 79 : 9689 - 9689