DGPINet-KD: Deep Guided and Progressive Integration Network With Knowledge Distillation for RGB-D Indoor Scene Analysis

被引:14
|
作者
Zhou, Wujie [1 ,2 ]
Jian, Bitao [1 ]
Fang, Meixin [1 ]
Dong, Xiena [1 ]
Liu, Yuanyuan [2 ,3 ]
Jiang, Qiuping [4 ]
机构
[1] Zhejiang Univ Sci & Technol, Sch Informat & Elect Engn, Hangzhou 310023, Peoples R China
[2] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 308232, Singapore
[3] China Univ Geosci, Sch Comp Sci, Wuhan 430074, Peoples R China
[4] Ningbo Univ, Sch Informat Sci & Engn, Ningbo 315211, Peoples R China
基金
中国国家自然科学基金;
关键词
RGB-D data; indoor scene analysis; knowledge distillation; branch attention; depth guidance; SEGMENTATION;
D O I
10.1109/TCSVT.2024.3382354
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Significant advancements in RGB-D semantic segmentation have been made owing to the increasing availability of robust depth information. Most researchers have combined depth with RGB data to capture complementary information in images. Although this approach improves segmentation performance, it requires excessive model parameters. To address this problem, we propose DGPINet-KD, a deep-guided and progressive integration network with knowledge distillation (KD) for RGB-D indoor scene analysis. First, we used branching attention and depth guidance to capture coordinated, precise location information and extract more complete spatial information from the depth map to complement the semantic information for the encoded features. Second, we trained the student network (DGPINet-S) with a well-trained teacher network (DGPINet-T) using a multilevel KD. Third, an integration unit was developed to explore the contextual dependencies of the decoding features and to enhance relational KD. Comprehensive experiments on two challenging indoor benchmark datasets, NYUDv2 and SUN RGB-D, demonstrated that DGPINet-KD achieved improved performance in indoor scene analysis tasks compared with existing methods. Notably, on the NYUDv2 dataset, DGPINet-KD (DGPINet-S with KD) achieves a pixel accuracy gain of 1.7% and a class accuracy gain of 2.3% compared with DGPINet-S. In addition, compared with DGPINet-T, the proposed DGPINet-KD (DGPINet-S with KD) utilizes significantly fewer parameters (29.3M) while maintaining accuracy. The source code is available at https://github.com/XUEXIKUAIL/DGPINet.
引用
收藏
页码:7844 / 7855
页数:12
相关论文
共 8 条
  • [1] Lightweight Dual Stream Network With Knowledge Distillation for RGB-D Scene Parsing
    Zhang, Yuming
    Zhou, Wujie
    Ran, Xiaoxiao
    Fang, Meixin
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 855 - 859
  • [2] FIMKD: Feature-Implicit Mapping Knowledge Distillation for RGB-D Indoor Scene Semantic Segmentation
    Zhou, Wujie
    Xiao, Yuxiang
    Liu, Yuanyuan
    Jiang, Qiuping
    IEEE Transactions on Artificial Intelligence, 2024, 5 (12): : 6488 - 6499
  • [3] Morphology-Guided Network via Knowledge Distillation for RGB-D Mirror Segmentation
    Zhou, Wujie
    Cai, Yuqi
    Qiang, Fangfang
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 17382 - 17391
  • [4] ADRNet-S*: Asymmetric depth registration network via contrastive knowledge distillation for RGB-D mirror segmentation
    Zhou, Wujie
    Cai, Yuqi
    Dong, Xiena
    Qiang, Fangfang
    Qiu, Weiwei
    INFORMATION FUSION, 2024, 108
  • [5] LCH: fast RGB-D salient object detection on CPU via lightweight convolutional network with hybrid knowledge distillation
    Binglu Wang
    Fan Zhang
    Yongqiang Zhao
    The Visual Computer, 2024, 40 : 1997 - 2014
  • [6] LCH: fast RGB-D salient object detection on CPU via lightweight convolutional network with hybrid knowledge distillation
    Wang, Binglu
    Zhang, Fan
    Zhao, Yongqiang
    VISUAL COMPUTER, 2024, 40 (03) : 1997 - 2014
  • [7] Enhancing RGB-D Mirror Segmentation With a Neighborhood-Matching and Demand-Modal Adaptive Network Using Knowledge Distillation
    Zhou, Wujie
    Zhang, Han
    Liu, Yuanyuan
    Luo, Ting
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 : 12679 - 12692
  • [8] Progressive Guided Fusion Network With Multi-Modal and Multi-Scale Attention for RGB-D Salient Object Detection
    Wu, Jiajia
    Han, Guangliang
    Wang, Haining
    Yang, Hang
    Li, Qingqing
    Liu, Dongxu
    Ye, Fangjian
    Liu, Peixun
    IEEE ACCESS, 2021, 9 : 150608 - 150622