DGPINet-KD: Deep Guided and Progressive Integration Network With Knowledge Distillation for RGB-D Indoor Scene Analysis

被引：14

作者：

Zhou, Wujie ^{[1
,2
]}

Jian, Bitao ^{[1
]}

Fang, Meixin ^{[1
]}

Dong, Xiena ^{[1
]}

Liu, Yuanyuan ^{[2
,3
]}

Jiang, Qiuping ^{[4
]}

机构：

[1] Zhejiang Univ Sci & Technol, Sch Informat & Elect Engn, Hangzhou 310023, Peoples R China

[2] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 308232, Singapore

[3] China Univ Geosci, Sch Comp Sci, Wuhan 430074, Peoples R China

[4] Ningbo Univ, Sch Informat Sci & Engn, Ningbo 315211, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2024年 / 34卷 / 09期

基金：

中国国家自然科学基金;

关键词：

RGB-D data; indoor scene analysis; knowledge distillation; branch attention; depth guidance; SEGMENTATION;

D O I：

10.1109/TCSVT.2024.3382354

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Significant advancements in RGB-D semantic segmentation have been made owing to the increasing availability of robust depth information. Most researchers have combined depth with RGB data to capture complementary information in images. Although this approach improves segmentation performance, it requires excessive model parameters. To address this problem, we propose DGPINet-KD, a deep-guided and progressive integration network with knowledge distillation (KD) for RGB-D indoor scene analysis. First, we used branching attention and depth guidance to capture coordinated, precise location information and extract more complete spatial information from the depth map to complement the semantic information for the encoded features. Second, we trained the student network (DGPINet-S) with a well-trained teacher network (DGPINet-T) using a multilevel KD. Third, an integration unit was developed to explore the contextual dependencies of the decoding features and to enhance relational KD. Comprehensive experiments on two challenging indoor benchmark datasets, NYUDv2 and SUN RGB-D, demonstrated that DGPINet-KD achieved improved performance in indoor scene analysis tasks compared with existing methods. Notably, on the NYUDv2 dataset, DGPINet-KD (DGPINet-S with KD) achieves a pixel accuracy gain of 1.7% and a class accuracy gain of 2.3% compared with DGPINet-S. In addition, compared with DGPINet-T, the proposed DGPINet-KD (DGPINet-S with KD) utilizes significantly fewer parameters (29.3M) while maintaining accuracy. The source code is available at https://github.com/XUEXIKUAIL/DGPINet.

引用

页码：7844 / 7855

页数：12

共 8 条

[1] Lightweight Dual Stream Network With Knowledge Distillation for RGB-D Scene Parsing
Zhang, Yuming
Zhou, Wujie
Ran, Xiaoxiao
Fang, Meixin
IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 855 - 859
[2] FIMKD: Feature-Implicit Mapping Knowledge Distillation for RGB-D Indoor Scene Semantic Segmentation
Zhou, Wujie
Xiao, Yuxiang
Liu, Yuanyuan
Jiang, Qiuping
IEEE Transactions on Artificial Intelligence, 2024, 5 (12): : 6488 - 6499
[3] Morphology-Guided Network via Knowledge Distillation for RGB-D Mirror Segmentation
Zhou, Wujie
Cai, Yuqi
Qiang, Fangfang
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 17382 - 17391
[4] ADRNet-S*: Asymmetric depth registration network via contrastive knowledge distillation for RGB-D mirror segmentation
Zhou, Wujie
Cai, Yuqi
Dong, Xiena
Qiang, Fangfang
Qiu, Weiwei
INFORMATION FUSION, 2024, 108
[5] LCH: fast RGB-D salient object detection on CPU via lightweight convolutional network with hybrid knowledge distillation
Binglu Wang
Fan Zhang
Yongqiang Zhao
The Visual Computer, 2024, 40 : 1997 - 2014
[6] LCH: fast RGB-D salient object detection on CPU via lightweight convolutional network with hybrid knowledge distillation
Wang, Binglu
Zhang, Fan
Zhao, Yongqiang
VISUAL COMPUTER, 2024, 40 (03) : 1997 - 2014
[7] Enhancing RGB-D Mirror Segmentation With a Neighborhood-Matching and Demand-Modal Adaptive Network Using Knowledge Distillation
Zhou, Wujie
Zhang, Han
Liu, Yuanyuan
Luo, Ting
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 : 12679 - 12692
[8] Progressive Guided Fusion Network With Multi-Modal and Multi-Scale Attention for RGB-D Salient Object Detection
Wu, Jiajia
Han, Guangliang
Wang, Haining
Yang, Hang
Li, Qingqing
Liu, Dongxu
Ye, Fangjian
Liu, Peixun
IEEE ACCESS, 2021, 9 : 150608 - 150622

← 1 →