Label-Related-Guided Multimodality Long-Tailed Sewer Symbiotic Defect Recognition

被引:0
作者
Zhong, Yuzhong [1 ]
Zou, Yafeng [1 ]
Cheng, Jin [2 ]
Zhang, Linghu [2 ]
Yang, Dan [2 ]
Dian, Songyi [1 ]
机构
[1] Sichuan Univ, Coll Elect Engn, Chengdu 610065, Peoples R China
[2] Chengdu Xingrong Municipal Facil Management Co Ltd, Chengdu 610065, Peoples R China
基金
中国国家自然科学基金;
关键词
Adaptation models; Heavily-tailed distribution; Tail; Correlation; Image recognition; Visualization; Semantics; Training; Face recognition; Data models; Discrete image-text interaction; knowledge distillation (KD); long-tail learning; multilabel learning; sewer defect recognition; CLASSIFICATION;
D O I
10.1109/TIM.2025.3568955
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Automated sewer defect recognition technology based on machine vision is crucial for modern urban sewage systems. However, existing recognition models face two significant challenges: 1) inadequate performance in identifying rare but high-risk defects and 2) complex interrelations among co-occurring defects that hinder the extraction of discriminative features. To tackle these issues, we propose a label-guided multimodal sewer defect recognition method incorporating a tail-aware knowledge distillation (KD) strategy. This strategy involves fine-tuning the teacher model on tail data to guide the student model's learning process, enhancing its ability to identify rare defect features. Furthermore, our proposed discrete image-text interaction module (DITIM) explores the semantic relationships between image patches and text through an interactive mechanism, which helps uncover co-occurrence relationships within multilabel information. This improves the model's capability to capture complex correlations between different defects. The experimental validation on the Sewer-ML and QV-Pipe datasets demonstrates that our approach not only boosts overall recognition accuracy but also excels in detecting rare, high-risk defects, offering an effective technical solution for sewer defect identification and management.
引用
收藏
页数:11
相关论文
共 47 条
[1]   Sustainable urban infrastructure: A review [J].
Carvalho Ferrer, Ana Luiza ;
Tavares Thome, Antonio Marcio ;
Scavarda, Annibal Jose .
RESOURCES CONSERVATION AND RECYCLING, 2018, 128 :360-372
[2]   Multi-Label Meta Weighting for Long-Tailed Dynamic Scene Graph Generation [J].
Chen, Shuo ;
Du, Yingjun ;
Mettes, Pascal ;
Snoek, Cees G. M. .
PROCEEDINGS OF THE 2023 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2023, 2023, :39-47
[3]   Multi-Label Image Recognition with Graph Convolutional Networks [J].
Chen, Zhao-Min ;
Wei, Xiu-Shen ;
Wang, Peng ;
Guo, Yanwen .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5172-5181
[4]   DefectTR: End-to-end defect detection for sewage networks using a transformer [J].
Dang, L. Minh ;
Wang, Hanxiang ;
Li, Yanfen ;
Nguyen, Tan N. ;
Moon, Hyeonjoon .
CONSTRUCTION AND BUILDING MATERIALS, 2022, 325
[5]  
Dong BW, 2022, Arxiv, DOI arXiv:2210.01033
[6]   SlowFast Networks for Video Recognition [J].
Feichtenhofer, Christoph ;
Fan, Haoqi ;
Malik, Jitendra ;
He, Kaiming .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6201-6210
[7]   Exploring Classification Equilibrium in Long-Tailed Object Detection [J].
Feng, Chengjian ;
Zhong, Yujie ;
Huang, Weilin .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :3397-3406
[8]   Texts as Images in Prompt Tuning for Multi-Label Image Recognition [J].
Guo, Zixian ;
Dong, Bowen ;
Ji, Zhilong ;
Bai, Jinfeng ;
Guo, Yiwen ;
Zuo, Wangmeng .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, :2808-2817
[9]   Underground sewer pipe condition assessment based on convolutional neural networks [J].
Hassan, Syed Ibrahim ;
Dang, L. Minh ;
Mehmood, Irfan ;
Im, Suhyeon ;
Choi, Changho ;
Kang, Jaemo ;
Park, Young-Soo ;
Moon, Hyeonjoon .
AUTOMATION IN CONSTRUCTION, 2019, 106
[10]   Multi-scale hybrid vision transformer and Sinkhorn tokenizer for sewer defect classification [J].
Haurum, Joakim Bruslund ;
Madadi, Meysam ;
Escalera, Sergio ;
Moeslund, Thomas B. .
AUTOMATION IN CONSTRUCTION, 2022, 144