Review on Indoor RGB-D Semantic Segmentation with Deep Convolutional Neural Networks

被引：10

作者：

Barchid, Sami ^{[1
]}

Mennesson, Jose ^{[1
,2
]}

Djeraba, Chaabane ^{[1
]}

机构：

[1] Univ Lille, CNRS, Cent Lille, UMR 9189 CRIStAL, F-59000 Lille, France

[2] IMT Lille Douai, Inst Mines Telecom, Ctr Digital Syst, Douai, France

来源：

2021 INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI) | 2021年

关键词：

RGB-D Indoor Semantic Segmentation; Deep Convolutional Neural Networks; Deep Learning;

D O I：

10.1109/CBMI50038.2021.9461875

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Many research works focus on leveraging the complementary geometric information of indoor depth sensors in vision tasks performed by deep convolutional neural networks, notably semantic segmentation. These works deal with a specific vision task known as "RGB-D Indoor Semantic Segmentation". The challenges and resulting solutions of this task differ from its standard RGB counterpart. This results in a new active research topic. The objective of this paper is to introduce the field of Deep Convolutional Neural Networks for RGB-D Indoor Semantic Segmentation. This review presents the most popular public datasets, proposes a categorization of the strategies employed by recent contributions, evaluates the performance of the current state-of-the-art, and discusses the remaining challenges and promising directions for future works.

引用

页码：199 / 202

页数：4

共 26 条

[1]

[Anonymous], 2016, CoRR. abs/1511.07122

[2]

Armeni I., 2017, ARXIV

[3] Matterport3D: Learning from RGB-D Data in Indoor Environments [J].

Chang, Angel ;

Dai, Angela ;

Funkhouser, Thomas ;

Halber, Maciej ;

Niessner, Matthias ;

Savva, Manolis ;

Song, Shuran ;

Zeng, Andy ;

Zhang, Yinda .

PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2017, :667-676

[4] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].

Chen, Liang-Chieh ;

Papandreou, George ;

Kokkinos, Iasonas ;

Murphy, Kevin ;

Yuille, Alan L. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848

[5] Bi-directional Cross-Modality Feature Propagation with Separation-and-Aggregation Gate for RGB-D Semantic Segmentation [J].

Chen, Xiaokang ;

Lin, Kwan-Yee ;

Wang, Jingbo ;

Wu, Wayne ;

Qian, Chen ;

Li, Hongsheng ;

Zeng, Gang .

COMPUTER VISION - ECCV 2020, PT XI, 2020, 12356 :561-577

[6] 3D Neighborhood Convolution: Learning Depth-Aware Features for RGB-D and RGB Semantic Segmentation [J].

Chen, Yunlu ;

Mensink, Thomas ;

Gavves, Efstratios .

2019 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2019), 2019, :173-182

[7] A survey on indoor RGB-D semantic segmentation: from hand-crafted features to deep convolutional neural networks [J].

Fooladgar, Fahimeh ;

Kasaei, Shohreh .

MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (7-8) :4499-4524

[8] Perceptual Organization and Recognition of Indoor Scenes from RGB-D Images [J].

Gupta, Saurabh ;

Arbelaez, Pablo ;

Malik, Jitendra .

2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :564-571

[9] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[10]

Hu XX, 2019, IEEE IMAGE PROC, P1440, DOI [10.1109/icip.2019.8803025, 10.1109/ICIP.2019.8803025]

← 1 2 3 →