FIMKD: Feature-Implicit Mapping Knowledge Distillation for RGB-D Indoor Scene Semantic Segmentation

被引：3

作者：

Zhou, Wujie ^{[1
]}

Xiao, Yuxiang ^{[1
]}

Liu, Yuanyuan ^{[2
,3
]}

Jiang, Qiuping ^{[4
]}

机构：

[1] Zhejiang University of Science and Technology, School of Information and Electronic Engineering, Hangzhou

[2] Nanyang Technological University, School of Computer Science and Engineering, Singapore

[3] China University of Geosciences, School of Computer and Technology, Wuhan

[4] Ningbo University, School of Information Science and Engineering, Ningbo

来源：

IEEE Transactions on Artificial Intelligence | 2024年 / 5卷 / 12期

基金：

中国国家自然科学基金;

关键词：

depth enhancement mask mapping; Knowledge distillation; lightweight model; red-green-blue-depth semantic segmentation;

D O I：

10.1109/TAI.2024.3452052

中图分类号：

学科分类号：

摘要：

Depth images are often used to improve the geometric understanding of scenes owing to their intuitive distance properties. Although there have been significant advancements in semantic segmentation tasks using red-green-blue-depth (RGB-D) images, the complexity of existing methods remains high. Furthermore, the requirement for high-quality depth images increases the model inference time, which limits the practicality of these methods. To address this issue, we propose a feature-implicit mapping knowledge distillation (FIMKD) method and a cross-modal knowledge distillation (KD) architecture to leverage deep modal information for training and reduce the model dependence on this information during inference. The approach comprises two networks: FIMKD-T, a teacher network that uses RGB-D data, and FIMKD-S, a student network that uses only RGB data. FIMKD-T extracts high-frequency information using the depth modality and compensates for the loss of RGB details due to a reduction in resolution during feature extraction by the high-frequency feature enhancement module, thereby enhancing the geometric perception of semantic features. In contrast, the FIMKD-S network does not employ deep learning techniques; instead, it uses a non-learning approach to extract high-frequency information. To enable the FIMKD-S network to learn deep features, we propose a feature-implicit mapping KD for feature distillation. This mapping technique maps the features in channel and space to a low-dimensional hidden layer, which helps to avoid inefficient single-pattern student learning. We evaluated the proposed FIMKD-S∗ (FIMKD-S with KD) on the NYUv2 and SUN-RGBD datasets. The results demonstrate that both FIMKD-T and FIMKD-S∗ achieve state-of-the-art performance. Furthermore, FIMKD-S∗ provides the best performance balance. The code for this work is available at https://github.com/SHARKALAKALA/FIMKD. © 2020 IEEE.

引用

页码：6488 / 6499

共 18 条

[1] DEPTH REMOVAL DISTILLATION FOR RGB-D SEMANTIC SEGMENTATION
Fang, Tiyu
Liang, Zhen
Shao, Xiuli
Dong, Zihao
Li, Jinping
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2405 - 2409
[2] Lightweight Dual Stream Network With Knowledge Distillation for RGB-D Scene Parsing
Zhang, Yuming
Zhou, Wujie
Ran, Xiaoxiao
Fang, Meixin
IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 855 - 859
[3] DGPINet-KD: Deep Guided and Progressive Integration Network With Knowledge Distillation for RGB-D Indoor Scene Analysis
Zhou, Wujie
Jian, Bitao
Fang, Meixin
Dong, Xiena
Liu, Yuanyuan
Jiang, Qiuping
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (09) : 7844 - 7855
[4] Morphology-Guided Network via Knowledge Distillation for RGB-D Mirror Segmentation
Zhou, Wujie
Cai, Yuqi
Qiang, Fangfang
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 17382 - 17391
[5] Contrastive learning-based knowledge distillation for RGB-thermal urban scene semantic segmentation
Guo, Xiaodong
Zhou, Wujie
Liu, Tong
KNOWLEDGE-BASED SYSTEMS, 2024, 292
[6] FCKDNet: A Feature Condensation Knowledge Distillation Network for Semantic Segmentation
Yuan, Wenhao
Lu, Xiaoyan
Zhang, Rongfen
Liu, Yuhong
ENTROPY, 2023, 25 (01)
[7] ADRNet-S*: Asymmetric depth registration network via contrastive knowledge distillation for RGB-D mirror segmentation
Zhou, Wujie
Cai, Yuqi
Dong, Xiena
Qiang, Fangfang
Qiu, Weiwei
INFORMATION FUSION, 2024, 108
[8] FRKDNet:feature refine semantic segmentation network based on knowledge distillation
Jiang Shi-yi
Xu Yang
Li Dan-yang
Fan Run-ze
CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2023, 38 (11) : 1590 - 1599
[9] HDBFormer: Efficient RGB-D Semantic Segmentation With a Heterogeneous Dual-Branch Framework
Wei, Shuobin
Zhou, Zhuang
Lu, Zhengan
Yuan, Zizhao
Su, Binghua
IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 91 - 95
[10] Enhancing RGB-D Mirror Segmentation With a Neighborhood-Matching and Demand-Modal Adaptive Network Using Knowledge Distillation
Zhou, Wujie
Zhang, Han
Liu, Yuanyuan
Luo, Ting
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 : 12679 - 12692

← 1 2 →