Learning deep cross-scale feature propagation for indoor semantic segmentation

被引：6

作者：

Huan, Linxi ^{[1
]}

Zheng, Xianwei ^{[1
]}

Tang, Shengjun ^{[2
]}

Gong, Jianya ^{[1
,3
]}

机构：

[1] Wuhan Univ, State Key Lab LIESMARS, Wuhan, Peoples R China

[2] Shenzhen Univ, Sch Architecture & Urban Planning, Shenzhen, Peoples R China

[3] Wuhan Univ, Sch Remote Sensing & Engn, Wuhan, Peoples R China

来源：

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING | 2021年 / 176卷

基金：

中国国家自然科学基金;

关键词：

Indoor scene parsing; Semantic segmentation; Deep learning; Cross-scale feature propagation; IMAGE; CLASSIFICATION;

D O I：

10.1016/j.isprsjprs.2021.03.023

中图分类号：

P9 [自然地理学];

学科分类号：

0705 ; 070501 ;

摘要：

Indoor semantic segmentation is a long-standing vision task that has been recently advanced by convolutional neural networks (CNNs), but this task remains challenging by high occlusion and large scale variation of indoor scenes. Existing CNN-based methods mainly focus on using auxiliary depth data to enrich features extracted from RGB images, hence, they pay less attention to exploiting multi-scale information in exracted features, which is essential for distinguishing objects in highly cluttered indoor scenes. This paper proposes a deep cross-scale feature propagation network (CSNet), to effectively learn and fuse multi-scale features for robust semantic segmentation of indoor scene images. The proposed CSNet is deployed as an encoder-decoder engine. During encoding, the CSNet propagates contextual information across scales and learn discriminative multi-scale features, which are robust to large object scale variation and indoor occlusion. The decoder of CSNet then adaptively integrates the multi-scale encoded features with fusion supervision at all scales to generate target semantic segmentation prediction. Extensive experiments conducted on two challenging benchmarks demonstrate that the CSNet can effectively learn multi-scale representations for robust indoor semantic segmentation, achieving outstanding performance with mIoU scores of 51.5 and 50.8 on NYUDv2 and SUN-RGBD datasets, respectively.

引用

页码：42 / 53

页数：12

共 50 条

[31] Review of Deep Learning-Based Semantic Segmentation
Zhang Xiangfu
Jian, Liu
Shi Zhangsong
Wu Zhonghong
Zhi, Wang
LASER & OPTOELECTRONICS PROGRESS, 2019, 56 (15)
[32] C3Net: Cross-Modal Feature Recalibrated, Cross-Scale Semantic Aggregated and Compact Network for Semantic Segmentation of Multi-Modal High-Resolution Aerial Images
Cao, Zhiying
Diao, Wenhui
Sun, Xian
Lyu, Xiaode
Yan, Menglong
Fu, Kun
REMOTE SENSING, 2021, 13 (03)
[33] Review of Semantic Segmentation by Using Deep learning methods
Rajeswari, B.
Ram, J. Mani
Kumar, D. V. T. Praveen
Harshith, K. L. V. V.
2024 INTERNATIONAL CONFERENCE ON SOCIAL AND SUSTAINABLE INNOVATIONS IN TECHNOLOGY AND ENGINEERING, SASI-ITE 2024, 2024, : 272 - 277
[34] Discriminative Feature Learning for Video Semantic Segmentation
Zhang, Han
Jiang, Kai
Zhang, Yu
Li, Qing
Xia, Changqun
Chen, Xiaowu
2014 INTERNATIONAL CONFERENCE ON VIRTUAL REALITY AND VISUALIZATION (ICVRV2014), 2014, : 321 - 326
[35] Semantic image segmentation network based on deep learning
Chen, Bo
Zhang, Jiahao
Zhou, Jianbang
Chen, Zhong
Yang, Tian
Zhang, Yanna
MIPPR 2019: AUTOMATIC TARGET RECOGNITION AND NAVIGATION, 2020, 11429
[36] PCSformer: Pair-wise Cross-scale Sub-prototypes mining with CNN-transformers for weakly supervised semantic segmentation
Liu, Chunmeng
Shen, Yao
Xiao, Qingguo
Li, Guangyao
NEUROCOMPUTING, 2024, 593
[37] Deep learning-based semantic segmentation of three-dimensional point cloud: a comprehensive review
Singh, Dheerendra Pratap
Yadav, Manohar
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2024, 45 (02) : 532 - 586
[38] CFNet: Cross-scale fusion network for medical image segmentation
Benabid, Amina
Yuan, Jing
Elhassan, Mohmmed A. M.
Benabid, Douaa
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2024, 36 (07)
[39] Deep learning-based hybrid feature selection for the semantic segmentation of crops and weeds
Janneh, Lamin L.
Youngjun, Zhang
Hydara, Mbemba
Cui, Zhongwei
ICT EXPRESS, 2024, 10 (01): : 118 - 124
[40] Cross-scale informative priors network for medical image segmentation
Sui, Fuxian
Wang, Hua
Zhang, Fan
DIGITAL SIGNAL PROCESSING, 2025, 157

← 1 2 3 4 5 →