Learning deep cross-scale feature propagation for indoor semantic segmentation

被引:6
|
作者
Huan, Linxi [1 ]
Zheng, Xianwei [1 ]
Tang, Shengjun [2 ]
Gong, Jianya [1 ,3 ]
机构
[1] Wuhan Univ, State Key Lab LIESMARS, Wuhan, Peoples R China
[2] Shenzhen Univ, Sch Architecture & Urban Planning, Shenzhen, Peoples R China
[3] Wuhan Univ, Sch Remote Sensing & Engn, Wuhan, Peoples R China
基金
中国国家自然科学基金;
关键词
Indoor scene parsing; Semantic segmentation; Deep learning; Cross-scale feature propagation; IMAGE; CLASSIFICATION;
D O I
10.1016/j.isprsjprs.2021.03.023
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Indoor semantic segmentation is a long-standing vision task that has been recently advanced by convolutional neural networks (CNNs), but this task remains challenging by high occlusion and large scale variation of indoor scenes. Existing CNN-based methods mainly focus on using auxiliary depth data to enrich features extracted from RGB images, hence, they pay less attention to exploiting multi-scale information in exracted features, which is essential for distinguishing objects in highly cluttered indoor scenes. This paper proposes a deep cross-scale feature propagation network (CSNet), to effectively learn and fuse multi-scale features for robust semantic segmentation of indoor scene images. The proposed CSNet is deployed as an encoder-decoder engine. During encoding, the CSNet propagates contextual information across scales and learn discriminative multi-scale features, which are robust to large object scale variation and indoor occlusion. The decoder of CSNet then adaptively integrates the multi-scale encoded features with fusion supervision at all scales to generate target semantic segmentation prediction. Extensive experiments conducted on two challenging benchmarks demonstrate that the CSNet can effectively learn multi-scale representations for robust indoor semantic segmentation, achieving outstanding performance with mIoU scores of 51.5 and 50.8 on NYUDv2 and SUN-RGBD datasets, respectively.
引用
收藏
页码:42 / 53
页数:12
相关论文
共 50 条
  • [41] Deep learning based semantic segmentation approach for automatic detection of brain tumor
    Markkandeyan, S.
    Gupta, Shivani
    Narayanan, G. Venkat
    Reddy, M. Jithender
    Al-Khasawneh, Mahmoud Ahmad
    Ishrat, Mohammad
    Kiran, Ajmeera
    INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2023, 18 (04)
  • [42] Algorithms for semantic segmentation of multispectral remote sensing imagery using deep learning
    Kemker, Ronald
    Salvaggio, Carl
    Kanan, Christopher
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 145 : 60 - 77
  • [43] MULTICLASS SEMANTIC SEGMENTATION FOR DIGITISATION OF MOVABLE HERITAGE USING DEEP LEARNING TECHNIQUES
    Patrucco, Giacomo
    Setragno, Francesco
    VIRTUAL ARCHAEOLOGY REVIEW, 2021, 12 (25): : 85 - 98
  • [44] Deep-Learning-Based Semantic Segmentation of Remote Sensing Images: A Survey
    Huang, Liwei
    Jiang, Bitao
    Lv, Shouye
    Liu, Yanbo
    Fu, Ying
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 8370 - 8396
  • [45] Point Cloud Semantic Segmentation Using a Deep Learning Framework for Cultural Heritage
    Pierdicca, Roberto
    Paolanti, Marina
    Matrone, Francesca
    Martini, Massimo
    Morbidoni, Christian
    Malinverni, Eva Savina
    Frontoni, Emanuele
    Lingua, Andrea Maria
    REMOTE SENSING, 2020, 12 (06)
  • [46] A deep learning-based and adaptive region proposal algorithm for semantic segmentation
    Taghizadeh, Maryam
    Chalechale, Abdolah
    APPLIED SOFT COMPUTING, 2024, 155
  • [47] Saliency detection via cross-scale deep inference*
    Ren, Dakai
    Wen, Xiangming
    Jia, Tao
    Chen, Jiazhong
    Li, Zongyi
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 75
  • [48] Multi-scale Adaptive Feature Fusion Network for Semantic Segmentation in Remote Sensing Images
    Shang, Ronghua
    Zhang, Jiyu
    Jiao, Licheng
    Li, Yangyang
    Marturi, Naresh
    Stolkin, Rustam
    REMOTE SENSING, 2020, 12 (05)
  • [49] A Deep Learning Multiview Approach for the Semantic Segmentation of Heritage Building Point Clouds
    Pellis, Eugenio
    Masiero, Andrea
    Betti, Michele
    Tucci, Grazia
    Grussenmeyer, Pierre
    INTERNATIONAL JOURNAL OF ARCHITECTURAL HERITAGE, 2025,
  • [50] Deep Near Infrared Colorization with Semantic Segmentation and Transfer Learning
    Wang, Fengqiao
    Liu, Lu
    Jung, Cheolkon
    2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 455 - 458