Unsupervised domain adaptation-based crack segmentation using transformer network

被引：11

作者：

Beyene, Daniel Asefa ^{[1
]}

Tran, Dai Quoc ^{[2
]}

Maru, Michael Bekele ^{[3
]}

Kim, Taeheon ^{[4
]}

Park, Solmoi ^{[5
]}

Park, Seunghee ^{[3
]}

机构：

[1] Sungkyunkwan Univ, Dept Global Smart City, Suwon 16419, South Korea

[2] Sungkyunkwan Univ, Global Engn Inst Ultimate Soc, Suwon 16419, South Korea

[3] Sungkyunkwan Univ, Sch Civil Architectural & Environm Syst Engn, Suwon 16419, South Korea

[4] Sungkyunkwan Univ, DNBio Pharm Inc, Res Ctr, Suwon 16419, South Korea

[5] Pukyong Natl Univ, Dept Civil Engn, Pusan 48513, South Korea

来源：

JOURNAL OF BUILDING ENGINEERING | 2023年 / 80卷

基金：

新加坡国家研究基金会;

关键词：

Crack detection; Unsupervised domain adaptation; Masked image consistency; Transformer network; Convolutional neural networks;

D O I：

10.1016/j.jobe.2023.107889

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

Surface cracks are a common structural defect. The intelligent inspection of these defects through computer vision and deep learning is of paramount importance for early maintenance and operation. Despite the remarkable success of supervised learning methods in detecting surface cracks, their performance heavily relies on the availability of extensive labeled datasets. Annotating a single image can be a time-consuming process, prone to human error. Moreover, these methods often struggle to generalize effectively to unseen datasets due to disparities between source and target images. To address this issue, unsupervised domain adaptation comes into play, as it aims to transfer knowledge learned from the labeled source domain to the unlabeled target domain. Consequently, we conducted an evaluation of a recent unsupervised domain adaptation model for semantic segmentation that incorporates masked image consistency into DAFormer, a state-of-the-art model with the ability to adapt to various datasets. To assess the model's performance, we employed three publicly available crack datasets, each containing background and crack classes. Our study has revealed that : (1) SegFormer, a transformer-based model, outperforms ConvNet-based models without utilizing adaptation knowledge, demonstrating superior generalizability to previously unseen data. (2) The unsupervised domain-adaptation model consistently outperforms the source model, resulting in a significant enhancement in the mean intersection over union of SegFormer's source-only approach by a remarkable 10% to 22%. With the exception of a single case, the relative performance of unsupervised domain adaptation compared to supervised training with labeled data exceeds 85%, underscoring its promising performance in crack segmentation. Consequently, our adopted method emerges as a viable alternative, particularly in scenarios where labeled data is scarce or prohibitively expensive.

引用

页数：18

共 71 条

[31] Kim Su-Min, 2020, [Journal of The Korea Society of Computer and Information, 한국컴퓨터정보학회논문지], V25, P35, DOI 10.9708/jksci.2020.25.10.035
[32] König J, 2019, IEEE IMAGE PROC, P1460, DOI [10.1109/ICIP.2019.8803060, 10.1109/icip.2019.8803060]
[33] Konig J, 2022, Arxiv, DOI arXiv:2202.03714
[34] ImageNet Classification with Deep Convolutional Neural Networks
Krizhevsky, Alex
Sutskever, Ilya
Hinton, Geoffrey E.
[J]. COMMUNICATIONS OF THE ACM, 2017, 60 (06) : 84 - 90
[35] Unsupervised Deep Learning for Road Crack Classification by Fusing Convolutional Neural Network and K_Means Clustering
Li, Wei
Huyan, Ju
Gao, Rong
Hao, Xueli
Hu, Yuanjiao
Zhang, Yingjie
[J]. JOURNAL OF TRANSPORTATION ENGINEERING PART B-PAVEMENTS, 2021, 147 (04)
[36] Unsupervised Domain Adaptation for Remote Sensing Semantic Segmentation with Transformer
Li, Weitao
Gao, Hui
Su, Yi
Momanyi, Biffon Manyura
[J]. REMOTE SENSING, 2022, 14 (19)
[37] Lijuan Duan, 2020, ICMSSP 2020: Proceedings of the 2020 5th International Conference on Multimedia Systems and Signal Processing, P6, DOI 10.1145/3404716.3404720
[38] Liu H., 2021, IEEE Trans. Intell. Transp. Syst., V24, P1669
[39] SSD: Single Shot MultiBox Detector
Liu, Wei
Anguelov, Dragomir
Erhan, Dumitru
Szegedy, Christian
Reed, Scott
Fu, Cheng-Yang
Berg, Alexander C.
[J]. COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 : 21 - 37
[40] DeepCrack: A deep hierarchical feature learning architecture for crack segmentation
Liu, Yahui
Yao, Jian
Lu, Xiaohu
Xie, Renping
Li, Li
[J]. NEUROCOMPUTING, 2019, 338 : 139 - 153

← 1 2 3 4 5 6 7 8 →