RTLNet: Recursive Triple-Path Learning Network for Scene Parsing of RGB-D Images

被引:4
|
作者
Yue, Yuchun [1 ]
Zhou, Wujie [1 ]
Lei, Jingsheng [1 ]
Yu, Lu [2 ]
机构
[1] Zhejiang Univ Sci & Technol, Sch Informat & Elect Engn, Hangzhou 310023, Peoples R China
[2] Zhejiang Univ, Coll Informat & Elect Engn, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金;
关键词
Image segmentation; Semantics; Decoding; Training; Streaming media; Sensors; Feature extraction; Scene parsing; cross-modality fusion; multiscale feature fusion; recursive learning; deep learning;
D O I
10.1109/LSP.2021.3139567
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Scene parsing approaches have attracted extensive attention in recent years; although several methods have been developed for scene parsing, most include complex modules for both cross-modality fusion between RGB and depth images in the encoder and image scale level recovery in the decoder under label supervision for high inference accuracy. Cross-modality information in the encoder may be diluted when processed through the decoder, and the supervision results may not be reused effectively, which adversely affects scene parsing. To address these problems, we propose a recursive triple-path learning network (RTLNet) for cross-modality interactions in the decoder using global context and cross-modality fusion modules. The proposed modules fully use cross-modality information to reduce information loss. To enhance the robustness of RTLNet, we add a path to reuse the initial predictions from the decoder and introduce a ladder-shaped feature consistency module to further leverage multiscale features. Experiments are conducted with the proposed RTLNet and nine recent RGB-D indoor scene parsing methods on the NYUv2 and SUN-RGBD indoor scene datasets; the results show that the RTLNet outperforms the other methods.
引用
收藏
页码:429 / 433
页数:5
相关论文
共 37 条
  • [21] CCAFNet: Crossflow and Cross-Scale Adaptive Fusion Network for Detecting Salient Objects in RGB-D Images
    Zhou, Wujie
    Zhu, Yun
    Lei, Jingsheng
    Wan, Jian
    Yu, Lu
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 2192 - 2204
  • [22] ACENet: Auxiliary Context-Information Enhancement Network for RGB-D Indoor Scene Semantic Segmentation
    Zhou, Wujie
    Xu, Gao
    Qiang, Fangfang
    Yu, Lu
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (02): : 1125 - 1129
  • [23] GFNet: Gate Fusion Network With Res2Net for Detecting Salient Objects in RGB-D Images
    Zhou, Wujie
    Chen, Yuzhen
    Liu, Chang
    Yu, Lu
    IEEE SIGNAL PROCESSING LETTERS, 2020, 27 : 800 - 804
  • [24] DDaNet: Dual-Path Depth-Aware Attention Network for Fingerspelling Recognition Using RGB-D Images
    Yang, Shih-Hung
    Chen, Wei-Ren
    Huang, Wun-Jhu
    Chen, Yon-Ping
    IEEE ACCESS, 2021, 9 (09): : 7306 - 7322
  • [25] CMPFFNet: Cross-Modal and Progressive Feature Fusion Network for RGB-D Indoor Scene Semantic Segmentation
    Zhou, Wujie
    Xiao, Yuxiang
    Yan, Weiqing
    Yu, Lu
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (04) : 5523 - 5533
  • [26] Enhancing Visual Odometry with Estimated Scene Depth: Leveraging RGB-D Data with Deep Learning
    Kostusiak, Aleksander
    Skrzypczynski, Piotr
    ELECTRONICS, 2024, 13 (14)
  • [27] Parallel RCNN: A Deep Learning Method for People Detection Using RGB-D Images
    Ren, Xiaodong
    Du, Sanping
    Zheng, Yi
    2017 10TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI), 2017,
  • [28] Pyramid Deep Fusion Network for Two-Hand Reconstruction From RGB-D Images
    Ren, Jinwei
    Zhu, Jianke
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) : 5843 - 5855
  • [29] RGB-D Segmentation Method for Group Piglets Images Based on Double-pyramid Network
    Gao Y.
    Liao H.
    Li X.
    Lei M.
    Yu M.
    Li X.
    1600, Chinese Society of Agricultural Machinery (51): : 36 - 43
  • [30] Automated segmentation of RGB-D images into a comprehensive set of building components using deep learning
    Czerniawski, Thomas
    Leite, Fernanda
    ADVANCED ENGINEERING INFORMATICS, 2020, 45 (45)