Learning deep cross-scale feature propagation for indoor semantic segmentation

被引:6
|
作者
Huan, Linxi [1 ]
Zheng, Xianwei [1 ]
Tang, Shengjun [2 ]
Gong, Jianya [1 ,3 ]
机构
[1] Wuhan Univ, State Key Lab LIESMARS, Wuhan, Peoples R China
[2] Shenzhen Univ, Sch Architecture & Urban Planning, Shenzhen, Peoples R China
[3] Wuhan Univ, Sch Remote Sensing & Engn, Wuhan, Peoples R China
基金
中国国家自然科学基金;
关键词
Indoor scene parsing; Semantic segmentation; Deep learning; Cross-scale feature propagation; IMAGE; CLASSIFICATION;
D O I
10.1016/j.isprsjprs.2021.03.023
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Indoor semantic segmentation is a long-standing vision task that has been recently advanced by convolutional neural networks (CNNs), but this task remains challenging by high occlusion and large scale variation of indoor scenes. Existing CNN-based methods mainly focus on using auxiliary depth data to enrich features extracted from RGB images, hence, they pay less attention to exploiting multi-scale information in exracted features, which is essential for distinguishing objects in highly cluttered indoor scenes. This paper proposes a deep cross-scale feature propagation network (CSNet), to effectively learn and fuse multi-scale features for robust semantic segmentation of indoor scene images. The proposed CSNet is deployed as an encoder-decoder engine. During encoding, the CSNet propagates contextual information across scales and learn discriminative multi-scale features, which are robust to large object scale variation and indoor occlusion. The decoder of CSNet then adaptively integrates the multi-scale encoded features with fusion supervision at all scales to generate target semantic segmentation prediction. Extensive experiments conducted on two challenging benchmarks demonstrate that the CSNet can effectively learn multi-scale representations for robust indoor semantic segmentation, achieving outstanding performance with mIoU scores of 51.5 and 50.8 on NYUDv2 and SUN-RGBD datasets, respectively.
引用
收藏
页码:42 / 53
页数:12
相关论文
共 50 条
  • [31] Review of Deep Learning-Based Semantic Segmentation
    Zhang Xiangfu
    Jian, Liu
    Shi Zhangsong
    Wu Zhonghong
    Zhi, Wang
    LASER & OPTOELECTRONICS PROGRESS, 2019, 56 (15)
  • [32] C3Net: Cross-Modal Feature Recalibrated, Cross-Scale Semantic Aggregated and Compact Network for Semantic Segmentation of Multi-Modal High-Resolution Aerial Images
    Cao, Zhiying
    Diao, Wenhui
    Sun, Xian
    Lyu, Xiaode
    Yan, Menglong
    Fu, Kun
    REMOTE SENSING, 2021, 13 (03)
  • [33] Review of Semantic Segmentation by Using Deep learning methods
    Rajeswari, B.
    Ram, J. Mani
    Kumar, D. V. T. Praveen
    Harshith, K. L. V. V.
    2024 INTERNATIONAL CONFERENCE ON SOCIAL AND SUSTAINABLE INNOVATIONS IN TECHNOLOGY AND ENGINEERING, SASI-ITE 2024, 2024, : 272 - 277
  • [34] Discriminative Feature Learning for Video Semantic Segmentation
    Zhang, Han
    Jiang, Kai
    Zhang, Yu
    Li, Qing
    Xia, Changqun
    Chen, Xiaowu
    2014 INTERNATIONAL CONFERENCE ON VIRTUAL REALITY AND VISUALIZATION (ICVRV2014), 2014, : 321 - 326
  • [35] Semantic image segmentation network based on deep learning
    Chen, Bo
    Zhang, Jiahao
    Zhou, Jianbang
    Chen, Zhong
    Yang, Tian
    Zhang, Yanna
    MIPPR 2019: AUTOMATIC TARGET RECOGNITION AND NAVIGATION, 2020, 11429
  • [36] PCSformer: Pair-wise Cross-scale Sub-prototypes mining with CNN-transformers for weakly supervised semantic segmentation
    Liu, Chunmeng
    Shen, Yao
    Xiao, Qingguo
    Li, Guangyao
    NEUROCOMPUTING, 2024, 593
  • [37] Deep learning-based semantic segmentation of three-dimensional point cloud: a comprehensive review
    Singh, Dheerendra Pratap
    Yadav, Manohar
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2024, 45 (02) : 532 - 586
  • [38] CFNet: Cross-scale fusion network for medical image segmentation
    Benabid, Amina
    Yuan, Jing
    Elhassan, Mohmmed A. M.
    Benabid, Douaa
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2024, 36 (07)
  • [39] Deep learning-based hybrid feature selection for the semantic segmentation of crops and weeds
    Janneh, Lamin L.
    Youngjun, Zhang
    Hydara, Mbemba
    Cui, Zhongwei
    ICT EXPRESS, 2024, 10 (01): : 118 - 124
  • [40] Cross-scale informative priors network for medical image segmentation
    Sui, Fuxian
    Wang, Hua
    Zhang, Fan
    DIGITAL SIGNAL PROCESSING, 2025, 157