Learning deep cross-scale feature propagation for indoor semantic segmentation

被引：6

作者：

Huan, Linxi ^{[1
]}

Zheng, Xianwei ^{[1
]}

Tang, Shengjun ^{[2
]}

Gong, Jianya ^{[1
,3
]}

机构：

[1] Wuhan Univ, State Key Lab LIESMARS, Wuhan, Peoples R China

[2] Shenzhen Univ, Sch Architecture & Urban Planning, Shenzhen, Peoples R China

[3] Wuhan Univ, Sch Remote Sensing & Engn, Wuhan, Peoples R China

来源：

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING | 2021年 / 176卷

基金：

中国国家自然科学基金;

关键词：

Indoor scene parsing; Semantic segmentation; Deep learning; Cross-scale feature propagation; IMAGE; CLASSIFICATION;

D O I：

10.1016/j.isprsjprs.2021.03.023

中图分类号：

P9 [自然地理学];

学科分类号：

0705 ; 070501 ;

摘要：

Indoor semantic segmentation is a long-standing vision task that has been recently advanced by convolutional neural networks (CNNs), but this task remains challenging by high occlusion and large scale variation of indoor scenes. Existing CNN-based methods mainly focus on using auxiliary depth data to enrich features extracted from RGB images, hence, they pay less attention to exploiting multi-scale information in exracted features, which is essential for distinguishing objects in highly cluttered indoor scenes. This paper proposes a deep cross-scale feature propagation network (CSNet), to effectively learn and fuse multi-scale features for robust semantic segmentation of indoor scene images. The proposed CSNet is deployed as an encoder-decoder engine. During encoding, the CSNet propagates contextual information across scales and learn discriminative multi-scale features, which are robust to large object scale variation and indoor occlusion. The decoder of CSNet then adaptively integrates the multi-scale encoded features with fusion supervision at all scales to generate target semantic segmentation prediction. Extensive experiments conducted on two challenging benchmarks demonstrate that the CSNet can effectively learn multi-scale representations for robust indoor semantic segmentation, achieving outstanding performance with mIoU scores of 51.5 and 50.8 on NYUDv2 and SUN-RGBD datasets, respectively.

引用

页码：42 / 53

页数：12

共 50 条

[41] Deep learning based semantic segmentation approach for automatic detection of brain tumor
Markkandeyan, S.
Gupta, Shivani
Narayanan, G. Venkat
Reddy, M. Jithender
Al-Khasawneh, Mahmoud Ahmad
Ishrat, Mohammad
Kiran, Ajmeera
INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2023, 18 (04)
[42] Algorithms for semantic segmentation of multispectral remote sensing imagery using deep learning
Kemker, Ronald
Salvaggio, Carl
Kanan, Christopher
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 145 : 60 - 77
[43] MULTICLASS SEMANTIC SEGMENTATION FOR DIGITISATION OF MOVABLE HERITAGE USING DEEP LEARNING TECHNIQUES
Patrucco, Giacomo
Setragno, Francesco
VIRTUAL ARCHAEOLOGY REVIEW, 2021, 12 (25): : 85 - 98
[44] Deep-Learning-Based Semantic Segmentation of Remote Sensing Images: A Survey
Huang, Liwei
Jiang, Bitao
Lv, Shouye
Liu, Yanbo
Fu, Ying
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 8370 - 8396
[45] Point Cloud Semantic Segmentation Using a Deep Learning Framework for Cultural Heritage
Pierdicca, Roberto
Paolanti, Marina
Matrone, Francesca
Martini, Massimo
Morbidoni, Christian
Malinverni, Eva Savina
Frontoni, Emanuele
Lingua, Andrea Maria
REMOTE SENSING, 2020, 12 (06)
[46] A deep learning-based and adaptive region proposal algorithm for semantic segmentation
Taghizadeh, Maryam
Chalechale, Abdolah
APPLIED SOFT COMPUTING, 2024, 155
[47] Saliency detection via cross-scale deep inference*
Ren, Dakai
Wen, Xiangming
Jia, Tao
Chen, Jiazhong
Li, Zongyi
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 75
[48] Multi-scale Adaptive Feature Fusion Network for Semantic Segmentation in Remote Sensing Images
Shang, Ronghua
Zhang, Jiyu
Jiao, Licheng
Li, Yangyang
Marturi, Naresh
Stolkin, Rustam
REMOTE SENSING, 2020, 12 (05)
[49] A Deep Learning Multiview Approach for the Semantic Segmentation of Heritage Building Point Clouds
Pellis, Eugenio
Masiero, Andrea
Betti, Michele
Tucci, Grazia
Grussenmeyer, Pierre
INTERNATIONAL JOURNAL OF ARCHITECTURAL HERITAGE, 2025,
[50] Deep Near Infrared Colorization with Semantic Segmentation and Transfer Learning
Wang, Fengqiao
Liu, Lu
Jung, Cheolkon
2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 455 - 458

← 1 2 3 4 5 →