Label-Related-Guided Multimodality Long-Tailed Sewer Symbiotic Defect Recognition

被引:0
作者
Zhong, Yuzhong [1 ]
Zou, Yafeng [1 ]
Cheng, Jin [2 ]
Zhang, Linghu [2 ]
Yang, Dan [2 ]
Dian, Songyi [1 ]
机构
[1] Sichuan Univ, Coll Elect Engn, Chengdu 610065, Peoples R China
[2] Chengdu Xingrong Municipal Facil Management Co Ltd, Chengdu 610065, Peoples R China
基金
中国国家自然科学基金;
关键词
Adaptation models; Heavily-tailed distribution; Tail; Correlation; Image recognition; Visualization; Semantics; Training; Face recognition; Data models; Discrete image-text interaction; knowledge distillation (KD); long-tail learning; multilabel learning; sewer defect recognition; CLASSIFICATION;
D O I
10.1109/TIM.2025.3568955
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Automated sewer defect recognition technology based on machine vision is crucial for modern urban sewage systems. However, existing recognition models face two significant challenges: 1) inadequate performance in identifying rare but high-risk defects and 2) complex interrelations among co-occurring defects that hinder the extraction of discriminative features. To tackle these issues, we propose a label-guided multimodal sewer defect recognition method incorporating a tail-aware knowledge distillation (KD) strategy. This strategy involves fine-tuning the teacher model on tail data to guide the student model's learning process, enhancing its ability to identify rare defect features. Furthermore, our proposed discrete image-text interaction module (DITIM) explores the semantic relationships between image patches and text through an interactive mechanism, which helps uncover co-occurrence relationships within multilabel information. This improves the model's capability to capture complex correlations between different defects. The experimental validation on the Sewer-ML and QV-Pipe datasets demonstrates that our approach not only boosts overall recognition accuracy but also excels in detecting rare, high-risk defects, offering an effective technical solution for sewer defect identification and management.
引用
收藏
页数:11
相关论文
共 47 条
[41]   Automatic Detection and Classification of Sewer Defects via Hierarchical Deep Learning [J].
Xie, Qian ;
Li, Dawei ;
Xu, Jinxuan ;
Yu, Zhenghao ;
Wang, Jun .
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2019, 16 (04) :1836-1847
[42]  
Yan JX, 2024, Arxiv, DOI arXiv:2408.08125
[43]  
Zhang X., 2021, P INT C COMP VIS OCT, P7960
[44]   Deep Mutual Learning [J].
Zhang, Ying ;
Xiang, Tao ;
Hospedales, Timothy M. ;
Lu, Huchuan .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4320-4328
[45]  
Zhao C., 2023, P IEEE INT C AC SPEE, P1
[46]   Trusted Video-Based Sewer Inspection via Support Clip-Based Pareto-Optimal Evidential Network [J].
Zhao, Chenyang ;
Hu, Chuanfei ;
Shao, Hang ;
Dunkin, Fir ;
Wang, Yongxiong .
IEEE SIGNAL PROCESSING LETTERS, 2025, 32 :356-360
[47]   Conditional Prompt Learning for Vision-Language Models [J].
Zhou, Kaiyang ;
Yang, Jingkang ;
Loy, Chen Change ;
Liu, Ziwei .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :16795-16804