Enhancing RGB-D Mirror Segmentation With a Neighborhood-Matching and Demand-Modal Adaptive Network Using Knowledge Distillation

被引:0
作者
Zhou, Wujie [1 ]
Zhang, Han [1 ]
Liu, Yuanyuan [2 ,3 ]
Luo, Ting [4 ]
机构
[1] Zhejiang Univ Sci & Technol, Sch Informat & Elect Engn, Hangzhou 310023, Peoples R China
[2] China Univ Geosci, Sch Comp & Technol, Wuhan 430074, Peoples R China
[3] Nanyang Technol Univ, Coll Comp & Data Sci, Singapore 308232, Singapore
[4] Ningbo Univ, Coll Sci & Technol, Ningbo 315300, Peoples R China
基金
中国国家自然科学基金;
关键词
Mirrors; Computational modeling; Computer vision; Semantic segmentation; Complexity theory; Image segmentation; Semantics; Knowledge transfer; Adaptation models; Noise; Mirror segmentation; knowledge distillation; sample complexity rater; multilevel distillation; SALIENT OBJECT DETECTION; SEMANTIC SEGMENTATION;
D O I
10.1109/TASE.2025.3547613
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent breakthroughs in computer vision have led to remarkable progress in the areas of autonomous vehicles and robotics. However, ordinary objects such as mirrors pose unique challenges to computer vision systems owing to occlusion, reflection, and distortion. Moreover, existing deep learning models suffer from issues such as excessive parameters and high computational complexity, making it challenging to implement numerous studies offline. To address these issues, we propose an innovative solution: a neighborhood-matching and demand-modal adaptive network using knowledge distillation (KD), called NDANet-S*, specifically designed for red-green-blue depth mirror segmentation. NDANet-S* operates by iteratively matching detailed and semantic difference between neighborhood features during the encoding phase. It then complements information across different modalities through demand-modal adaptation, enhancing heteromodal cross-complementation during the KD stage. In the decoding phase, semantic enhancement features and iterative encoding features are deeply integrated, forming a strong foundation for multistage progressive knowledge transfer in the KD process. Furthermore, we introduce a multistage teacher-assisted KD scheme, guided by sample complexity, to work synergistically with the mirror segmentation model. This innovative scheme includes a sample complexity rater, heterogeneous cross-complementarity, and hierarchical progressive knowledge transfer. Experimental evaluations on publicly available datasets indicate that NDANet-S* significantly enhances segmentation accuracy while preserving a consistent number of parameters. Additionally, it achieves state-of-the-art performance in mirror segmentation. The source code for our model is publicly available and can be accessed at: https://github.com/2021nihao/NMDANet.
引用
收藏
页码:12679 / 12692
页数:14
相关论文
共 76 条
  • [41] Learning Deep Representations with Probabilistic Knowledge Transfer
    Passalis, Nikolaos
    Tefas, Anastasios
    [J]. COMPUTER VISION - ECCV 2018, PT XI, 2018, 11215 : 283 - 299
  • [42] FedBKD: Heterogenous Federated Learning via Bidirectional Knowledge Distillation for Modulation Classification in IoT-Edge System
    Qi, Peihan
    Zhou, Xiaoyu
    Ding, Yuanlei
    Zhang, Zhengyu
    Zheng, Shilian
    Li, Zan
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2023, 17 (01) : 189 - 204
  • [43] U2-Net: Going deeper with nested U-structure for salient object detection
    Qin, Xuebin
    Zhang, Zichen
    Huang, Chenyang
    Dehghan, Masood
    Zaiane, Osmar R.
    Jagersand, Martin
    [J]. PATTERN RECOGNITION, 2020, 106 (106)
  • [44] BASNet: Boundary-Aware Salient Object Detection
    Qin, Xuebin
    Zhang, Zichen
    Huang, Chenyang
    Gao, Chao
    Dehghan, Masood
    Jagersand, Martin
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7471 - 7481
  • [45] Raid A., 2014, International Journal of Computer Science, Engineering and Information Technology (IJCSEIT), V4, P9, DOI DOI 10.5121/IJCSEIT.2014.4302
  • [46] Romero A, 2015, Arxiv, DOI arXiv:1412.6550
  • [47] Efficient Multi-Task RGB-D Scene Analysis for Indoor Environments
    Seichter, Daniel
    Fischedick, Soehnke Benedikt
    Koehler, Mona
    Gross, Horst-Michael
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [48] Explainable scale distillation for hyperspectral image classification
    Shi, Cheng
    Fang, Li
    Lv, Zhiyong
    Zhao, Minghua
    [J]. PATTERN RECOGNITION, 2022, 122
  • [49] Song SR, 2015, PROC CVPR IEEE, P567, DOI 10.1109/CVPR.2015.7298655
  • [50] Degradation Aware Approach to Image Restoration Using Knowledge Distillation
    Suin, Maitreya
    Purohit, Kuldeep
    Rajagopalan, A. N.
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2021, 15 (02) : 162 - 173