Review on Indoor RGB-D Semantic Segmentation with Deep Convolutional Neural Networks

被引:10
作者
Barchid, Sami [1 ]
Mennesson, Jose [1 ,2 ]
Djeraba, Chaabane [1 ]
机构
[1] Univ Lille, CNRS, Cent Lille, UMR 9189 CRIStAL, F-59000 Lille, France
[2] IMT Lille Douai, Inst Mines Telecom, Ctr Digital Syst, Douai, France
来源
2021 INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI) | 2021年
关键词
RGB-D Indoor Semantic Segmentation; Deep Convolutional Neural Networks; Deep Learning;
D O I
10.1109/CBMI50038.2021.9461875
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many research works focus on leveraging the complementary geometric information of indoor depth sensors in vision tasks performed by deep convolutional neural networks, notably semantic segmentation. These works deal with a specific vision task known as "RGB-D Indoor Semantic Segmentation". The challenges and resulting solutions of this task differ from its standard RGB counterpart. This results in a new active research topic. The objective of this paper is to introduce the field of Deep Convolutional Neural Networks for RGB-D Indoor Semantic Segmentation. This review presents the most popular public datasets, proposes a categorization of the strategies employed by recent contributions, evaluates the performance of the current state-of-the-art, and discusses the remaining challenges and promising directions for future works.
引用
收藏
页码:199 / 202
页数:4
相关论文
共 26 条
[1]  
[Anonymous], 2016, CoRR. abs/1511.07122
[2]  
Armeni I., 2017, ARXIV
[3]   Matterport3D: Learning from RGB-D Data in Indoor Environments [J].
Chang, Angel ;
Dai, Angela ;
Funkhouser, Thomas ;
Halber, Maciej ;
Niessner, Matthias ;
Savva, Manolis ;
Song, Shuran ;
Zeng, Andy ;
Zhang, Yinda .
PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2017, :667-676
[4]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[5]   Bi-directional Cross-Modality Feature Propagation with Separation-and-Aggregation Gate for RGB-D Semantic Segmentation [J].
Chen, Xiaokang ;
Lin, Kwan-Yee ;
Wang, Jingbo ;
Wu, Wayne ;
Qian, Chen ;
Li, Hongsheng ;
Zeng, Gang .
COMPUTER VISION - ECCV 2020, PT XI, 2020, 12356 :561-577
[6]   3D Neighborhood Convolution: Learning Depth-Aware Features for RGB-D and RGB Semantic Segmentation [J].
Chen, Yunlu ;
Mensink, Thomas ;
Gavves, Efstratios .
2019 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2019), 2019, :173-182
[7]   A survey on indoor RGB-D semantic segmentation: from hand-crafted features to deep convolutional neural networks [J].
Fooladgar, Fahimeh ;
Kasaei, Shohreh .
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (7-8) :4499-4524
[8]   Perceptual Organization and Recognition of Indoor Scenes from RGB-D Images [J].
Gupta, Saurabh ;
Arbelaez, Pablo ;
Malik, Jitendra .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :564-571
[9]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[10]  
Hu XX, 2019, IEEE IMAGE PROC, P1440, DOI [10.1109/icip.2019.8803025, 10.1109/ICIP.2019.8803025]