Unsupervised domain adaptation-based crack segmentation using transformer network

被引:11
作者
Beyene, Daniel Asefa [1 ]
Tran, Dai Quoc [2 ]
Maru, Michael Bekele [3 ]
Kim, Taeheon [4 ]
Park, Solmoi [5 ]
Park, Seunghee [3 ]
机构
[1] Sungkyunkwan Univ, Dept Global Smart City, Suwon 16419, South Korea
[2] Sungkyunkwan Univ, Global Engn Inst Ultimate Soc, Suwon 16419, South Korea
[3] Sungkyunkwan Univ, Sch Civil Architectural & Environm Syst Engn, Suwon 16419, South Korea
[4] Sungkyunkwan Univ, DNBio Pharm Inc, Res Ctr, Suwon 16419, South Korea
[5] Pukyong Natl Univ, Dept Civil Engn, Pusan 48513, South Korea
来源
JOURNAL OF BUILDING ENGINEERING | 2023年 / 80卷
基金
新加坡国家研究基金会;
关键词
Crack detection; Unsupervised domain adaptation; Masked image consistency; Transformer network; Convolutional neural networks;
D O I
10.1016/j.jobe.2023.107889
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Surface cracks are a common structural defect. The intelligent inspection of these defects through computer vision and deep learning is of paramount importance for early maintenance and operation. Despite the remarkable success of supervised learning methods in detecting surface cracks, their performance heavily relies on the availability of extensive labeled datasets. Annotating a single image can be a time-consuming process, prone to human error. Moreover, these methods often struggle to generalize effectively to unseen datasets due to disparities between source and target images. To address this issue, unsupervised domain adaptation comes into play, as it aims to transfer knowledge learned from the labeled source domain to the unlabeled target domain. Consequently, we conducted an evaluation of a recent unsupervised domain adaptation model for semantic segmentation that incorporates masked image consistency into DAFormer, a state-of-the-art model with the ability to adapt to various datasets. To assess the model's performance, we employed three publicly available crack datasets, each containing background and crack classes. Our study has revealed that : (1) SegFormer, a transformer-based model, outperforms ConvNet-based models without utilizing adaptation knowledge, demonstrating superior generalizability to previously unseen data. (2) The unsupervised domain-adaptation model consistently outperforms the source model, resulting in a significant enhancement in the mean intersection over union of SegFormer's source-only approach by a remarkable 10% to 22%. With the exception of a single case, the relative performance of unsupervised domain adaptation compared to supervised training with labeled data exceeds 85%, underscoring its promising performance in crack segmentation. Consequently, our adopted method emerges as a viable alternative, particularly in scenarios where labeled data is scarce or prohibitively expensive.
引用
收藏
页数:18
相关论文
共 71 条
  • [31] Kim Su-Min, 2020, [Journal of The Korea Society of Computer and Information, 한국컴퓨터정보학회논문지], V25, P35, DOI 10.9708/jksci.2020.25.10.035
  • [32] König J, 2019, IEEE IMAGE PROC, P1460, DOI [10.1109/ICIP.2019.8803060, 10.1109/icip.2019.8803060]
  • [33] Konig J, 2022, Arxiv, DOI arXiv:2202.03714
  • [34] ImageNet Classification with Deep Convolutional Neural Networks
    Krizhevsky, Alex
    Sutskever, Ilya
    Hinton, Geoffrey E.
    [J]. COMMUNICATIONS OF THE ACM, 2017, 60 (06) : 84 - 90
  • [35] Unsupervised Deep Learning for Road Crack Classification by Fusing Convolutional Neural Network and K_Means Clustering
    Li, Wei
    Huyan, Ju
    Gao, Rong
    Hao, Xueli
    Hu, Yuanjiao
    Zhang, Yingjie
    [J]. JOURNAL OF TRANSPORTATION ENGINEERING PART B-PAVEMENTS, 2021, 147 (04)
  • [36] Unsupervised Domain Adaptation for Remote Sensing Semantic Segmentation with Transformer
    Li, Weitao
    Gao, Hui
    Su, Yi
    Momanyi, Biffon Manyura
    [J]. REMOTE SENSING, 2022, 14 (19)
  • [37] Lijuan Duan, 2020, ICMSSP 2020: Proceedings of the 2020 5th International Conference on Multimedia Systems and Signal Processing, P6, DOI 10.1145/3404716.3404720
  • [38] Liu H., 2021, IEEE Trans. Intell. Transp. Syst., V24, P1669
  • [39] SSD: Single Shot MultiBox Detector
    Liu, Wei
    Anguelov, Dragomir
    Erhan, Dumitru
    Szegedy, Christian
    Reed, Scott
    Fu, Cheng-Yang
    Berg, Alexander C.
    [J]. COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 : 21 - 37
  • [40] DeepCrack: A deep hierarchical feature learning architecture for crack segmentation
    Liu, Yahui
    Yao, Jian
    Lu, Xiaohu
    Xie, Renping
    Li, Li
    [J]. NEUROCOMPUTING, 2019, 338 : 139 - 153