A deep learning-based global and segmentation-based semantic feature fusion approach for indoor scene classification

被引:2
作者
Pereira, Ricardo [1 ]
Barros, Tiago [1 ]
Garrote, Luis [1 ]
Lopes, Ana [1 ,2 ]
Nunes, Urbano J. [1 ]
机构
[1] Univ Coimbra, Inst Syst & Robot, Dept Elect & Comp Engn, Rua Silvio Lima Polo II, P-3030290 Coimbra, Portugal
[2] Polytech Inst Tomar, P-2300313 Tomar, Portugal
关键词
Indoor scene classification; Scene representation; Visual recognition; Global and local features; Segmentation-based features; NETWORK;
D O I
10.1016/j.patrec.2024.01.022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work proposes a novel approach that uses a semantic segmentation mask to obtain a 2D spatial layout of the segmentation-categories across the scene, designated by segmentation-based semantic features (SSFs). These features represent, per segmentation-category, the pixel count, as well as the 2D average position and respective standard deviation values. Moreover, a two-branch network, GS2F2App, that exploits CNN-based global features extracted from RGB images and the segmentation-based features extracted from the proposed SSFs, is also proposed. GS2F2App was evaluated in two indoor scene benchmark datasets: the SUN RGB-D and the NYU Depth V2, achieving state-of-the-art results on both datasets.
引用
收藏
页码:24 / 30
页数:7
相关论文
共 34 条
  • [1] A Novel Statistical Method for Scene Classification Based on Multi-Object Categorization and Logistic Regression
    Ahmed, Abrar
    Jalal, Ahmad
    Kim, Kibum
    [J]. SENSORS, 2020, 20 (14) : 1 - 20
  • [2] Ayub A., 2020, BRIT MACHINE VISION
  • [3] When CNNs meet random RNNs: Towards multi-level analysis for RGB-D object and scene recognition
    Caglayan, Ali
    Imamoglu, Nevrez
    Can, Ahmet Burak
    Nakamura, Ryosuke
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 217
  • [4] Chen L. C., 2018, EUROPEAN C COMPUTER
  • [5] Scene recognition with objectness
    Cheng, Xiaojuan
    Lu, Jiwen
    Feng, Jianjiang
    Yuan, Bo
    Zhou, Jie
    [J]. PATTERN RECOGNITION, 2018, 74 : 474 - 487
  • [6] Du DP, 2018, IEEE INT CON MULTI
  • [7] Perceptual Organization and Recognition of Indoor Scenes from RGB-D Images
    Gupta, Saurabh
    Arbelaez, Pablo
    Malik, Jitendra
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 564 - 571
  • [8] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [9] Herranz-Perdiguero C, 2018, IEEE INT C INT ROBOT, P355, DOI 10.1109/IROS.2018.8593736
  • [10] Densely Connected Convolutional Networks
    Huang, Gao
    Liu, Zhuang
    van der Maaten, Laurens
    Weinberger, Kilian Q.
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2261 - 2269