Enhancing RGB-D Mirror Segmentation With a Neighborhood-Matching and Demand-Modal Adaptive Network Using Knowledge Distillation

被引：0

作者：

Zhou, Wujie ^{[1
]}

Zhang, Han ^{[1
]}

Liu, Yuanyuan ^{[2
,3
]}

Luo, Ting ^{[4
]}

机构：

[1] Zhejiang Univ Sci & Technol, Sch Informat & Elect Engn, Hangzhou 310023, Peoples R China

[2] China Univ Geosci, Sch Comp & Technol, Wuhan 430074, Peoples R China

[3] Nanyang Technol Univ, Coll Comp & Data Sci, Singapore 308232, Singapore

[4] Ningbo Univ, Coll Sci & Technol, Ningbo 315300, Peoples R China

来源：

IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING | 2025年 / 22卷

基金：

中国国家自然科学基金;

关键词：

Mirrors; Computational modeling; Computer vision; Semantic segmentation; Complexity theory; Image segmentation; Semantics; Knowledge transfer; Adaptation models; Noise; Mirror segmentation; knowledge distillation; sample complexity rater; multilevel distillation; SALIENT OBJECT DETECTION; SEMANTIC SEGMENTATION;

D O I：

10.1109/TASE.2025.3547613

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Recent breakthroughs in computer vision have led to remarkable progress in the areas of autonomous vehicles and robotics. However, ordinary objects such as mirrors pose unique challenges to computer vision systems owing to occlusion, reflection, and distortion. Moreover, existing deep learning models suffer from issues such as excessive parameters and high computational complexity, making it challenging to implement numerous studies offline. To address these issues, we propose an innovative solution: a neighborhood-matching and demand-modal adaptive network using knowledge distillation (KD), called NDANet-S*, specifically designed for red-green-blue depth mirror segmentation. NDANet-S* operates by iteratively matching detailed and semantic difference between neighborhood features during the encoding phase. It then complements information across different modalities through demand-modal adaptation, enhancing heteromodal cross-complementation during the KD stage. In the decoding phase, semantic enhancement features and iterative encoding features are deeply integrated, forming a strong foundation for multistage progressive knowledge transfer in the KD process. Furthermore, we introduce a multistage teacher-assisted KD scheme, guided by sample complexity, to work synergistically with the mirror segmentation model. This innovative scheme includes a sample complexity rater, heterogeneous cross-complementarity, and hierarchical progressive knowledge transfer. Experimental evaluations on publicly available datasets indicate that NDANet-S* significantly enhances segmentation accuracy while preserving a consistent number of parameters. Additionally, it achieves state-of-the-art performance in mirror segmentation. The source code for our model is publicly available and can be accessed at: https://github.com/2021nihao/NMDANet.

引用

页码：12679 / 12692

页数：14

共 76 条

[41] Learning Deep Representations with Probabilistic Knowledge Transfer
Passalis, Nikolaos
Tefas, Anastasios
[J]. COMPUTER VISION - ECCV 2018, PT XI, 2018, 11215 : 283 - 299
[42] FedBKD: Heterogenous Federated Learning via Bidirectional Knowledge Distillation for Modulation Classification in IoT-Edge System
Qi, Peihan
Zhou, Xiaoyu
Ding, Yuanlei
Zhang, Zhengyu
Zheng, Shilian
Li, Zan
[J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2023, 17 (01) : 189 - 204
[43] U2-Net: Going deeper with nested U-structure for salient object detection
Qin, Xuebin
Zhang, Zichen
Huang, Chenyang
Dehghan, Masood
Zaiane, Osmar R.
Jagersand, Martin
[J]. PATTERN RECOGNITION, 2020, 106 (106)
[44] BASNet: Boundary-Aware Salient Object Detection
Qin, Xuebin
Zhang, Zichen
Huang, Chenyang
Gao, Chao
Dehghan, Masood
Jagersand, Martin
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7471 - 7481
[45] Raid A., 2014, International Journal of Computer Science, Engineering and Information Technology (IJCSEIT), V4, P9, DOI DOI 10.5121/IJCSEIT.2014.4302
[46] Romero A, 2015, Arxiv, DOI arXiv:1412.6550
[47] Efficient Multi-Task RGB-D Scene Analysis for Indoor Environments
Seichter, Daniel
Fischedick, Soehnke Benedikt
Koehler, Mona
Gross, Horst-Michael
[J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[48] Explainable scale distillation for hyperspectral image classification
Shi, Cheng
Fang, Li
Lv, Zhiyong
Zhao, Minghua
[J]. PATTERN RECOGNITION, 2022, 122
[49] Song SR, 2015, PROC CVPR IEEE, P567, DOI 10.1109/CVPR.2015.7298655
[50] Degradation Aware Approach to Image Restoration Using Knowledge Distillation
Suin, Maitreya
Purohit, Kuldeep
Rajagopalan, A. N.
[J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2021, 15 (02) : 162 - 173

← 1 2 3 4 5 6 7 8 →