FIMKD: Feature-Implicit Mapping Knowledge Distillation for RGB-D Indoor Scene Semantic Segmentation

被引:3
|
作者
Zhou, Wujie [1 ]
Xiao, Yuxiang [1 ]
Liu, Yuanyuan [2 ,3 ]
Jiang, Qiuping [4 ]
机构
[1] Zhejiang University of Science and Technology, School of Information and Electronic Engineering, Hangzhou
[2] Nanyang Technological University, School of Computer Science and Engineering, Singapore
[3] China University of Geosciences, School of Computer and Technology, Wuhan
[4] Ningbo University, School of Information Science and Engineering, Ningbo
来源
IEEE Transactions on Artificial Intelligence | 2024年 / 5卷 / 12期
基金
中国国家自然科学基金;
关键词
depth enhancement mask mapping; Knowledge distillation; lightweight model; red-green-blue-depth semantic segmentation;
D O I
10.1109/TAI.2024.3452052
中图分类号
学科分类号
摘要
Depth images are often used to improve the geometric understanding of scenes owing to their intuitive distance properties. Although there have been significant advancements in semantic segmentation tasks using red-green-blue-depth (RGB-D) images, the complexity of existing methods remains high. Furthermore, the requirement for high-quality depth images increases the model inference time, which limits the practicality of these methods. To address this issue, we propose a feature-implicit mapping knowledge distillation (FIMKD) method and a cross-modal knowledge distillation (KD) architecture to leverage deep modal information for training and reduce the model dependence on this information during inference. The approach comprises two networks: FIMKD-T, a teacher network that uses RGB-D data, and FIMKD-S, a student network that uses only RGB data. FIMKD-T extracts high-frequency information using the depth modality and compensates for the loss of RGB details due to a reduction in resolution during feature extraction by the high-frequency feature enhancement module, thereby enhancing the geometric perception of semantic features. In contrast, the FIMKD-S network does not employ deep learning techniques; instead, it uses a non-learning approach to extract high-frequency information. To enable the FIMKD-S network to learn deep features, we propose a feature-implicit mapping KD for feature distillation. This mapping technique maps the features in channel and space to a low-dimensional hidden layer, which helps to avoid inefficient single-pattern student learning. We evaluated the proposed FIMKD-S∗ (FIMKD-S with KD) on the NYUv2 and SUN-RGBD datasets. The results demonstrate that both FIMKD-T and FIMKD-S∗ achieve state-of-the-art performance. Furthermore, FIMKD-S∗ provides the best performance balance. The code for this work is available at https://github.com/SHARKALAKALA/FIMKD. © 2020 IEEE.
引用
收藏
页码:6488 / 6499
相关论文
共 18 条
  • [1] DEPTH REMOVAL DISTILLATION FOR RGB-D SEMANTIC SEGMENTATION
    Fang, Tiyu
    Liang, Zhen
    Shao, Xiuli
    Dong, Zihao
    Li, Jinping
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2405 - 2409
  • [2] Lightweight Dual Stream Network With Knowledge Distillation for RGB-D Scene Parsing
    Zhang, Yuming
    Zhou, Wujie
    Ran, Xiaoxiao
    Fang, Meixin
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 855 - 859
  • [3] DGPINet-KD: Deep Guided and Progressive Integration Network With Knowledge Distillation for RGB-D Indoor Scene Analysis
    Zhou, Wujie
    Jian, Bitao
    Fang, Meixin
    Dong, Xiena
    Liu, Yuanyuan
    Jiang, Qiuping
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (09) : 7844 - 7855
  • [4] Morphology-Guided Network via Knowledge Distillation for RGB-D Mirror Segmentation
    Zhou, Wujie
    Cai, Yuqi
    Qiang, Fangfang
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 17382 - 17391
  • [5] Contrastive learning-based knowledge distillation for RGB-thermal urban scene semantic segmentation
    Guo, Xiaodong
    Zhou, Wujie
    Liu, Tong
    KNOWLEDGE-BASED SYSTEMS, 2024, 292
  • [6] FCKDNet: A Feature Condensation Knowledge Distillation Network for Semantic Segmentation
    Yuan, Wenhao
    Lu, Xiaoyan
    Zhang, Rongfen
    Liu, Yuhong
    ENTROPY, 2023, 25 (01)
  • [7] ADRNet-S*: Asymmetric depth registration network via contrastive knowledge distillation for RGB-D mirror segmentation
    Zhou, Wujie
    Cai, Yuqi
    Dong, Xiena
    Qiang, Fangfang
    Qiu, Weiwei
    INFORMATION FUSION, 2024, 108
  • [8] FRKDNet:feature refine semantic segmentation network based on knowledge distillation
    Jiang Shi-yi
    Xu Yang
    Li Dan-yang
    Fan Run-ze
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2023, 38 (11) : 1590 - 1599
  • [9] HDBFormer: Efficient RGB-D Semantic Segmentation With a Heterogeneous Dual-Branch Framework
    Wei, Shuobin
    Zhou, Zhuang
    Lu, Zhengan
    Yuan, Zizhao
    Su, Binghua
    IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 91 - 95
  • [10] Enhancing RGB-D Mirror Segmentation With a Neighborhood-Matching and Demand-Modal Adaptive Network Using Knowledge Distillation
    Zhou, Wujie
    Zhang, Han
    Liu, Yuanyuan
    Luo, Ting
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 : 12679 - 12692