Two-stage framework with improved U-Net based on self-supervised contrastive learning for pavement crack segmentation

被引:5
作者
Song, Qingsong [1 ]
Yao, Wei [1 ]
Tian, Haojiang [1 ]
Guo, Yidan [1 ]
Muniyandi, Ravie Chandren [2 ]
An, Yisheng [1 ]
机构
[1] Changan Univ, Sch Informat Engn, Xian 710064, Peoples R China
[2] Univ Kebangsaan Malaysia, Fac Informat Sci & Technol, Bangi 43600, Selangor, Malaysia
关键词
Pavement crack segmentation; Self-supervised contrastive learning; Pre-training; U-Net; Attention;
D O I
10.1016/j.eswa.2023.122406
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
After the deep learning method emerged, the automated detection technology of pavement crack images has significantly progressed. The dominant approach is supervised deep learning, which relies on large-scale labeled ground truth. However, the problems are mostly unlabeled original crack images, which are difficult to fully utilize by the supervised deep learning network model. As a representative method of self-supervised learning, contrast learning can learn feature representations from unlabeled data, thus improving the accuracy of downstream tasks. This paper proposes a two-stage framework with improved U-Net based on self-supervised contrastive learning for pavement crack image segmentation. The framework takes improved U-Net as the basic architecture to highlight the significant features of the target segment of fine cracks. U-Net is improved by integrating the residual structure and attention mechanism in the typical U-Net architecture. The framework includes two learning stages: pre-training and fine-tuning. In the pre-training stage, the potential feature representation is learned from the unlabeled crack image. Crack images and pavement background images are used in the training data so that the model learns the distinguishable mapping relationship between crack and its background in the high-dimensional vector space without supervision comparison. In the fine-tuning stage, the network loads the parameters after the pre-training and uses the labeled training data for the retraining. Experimental results show that the proposed two-stage framework significantly improves the performance of crack segmentation accuracy without increasing the number of existing training samples and their labeling.
引用
收藏
页数:13
相关论文
共 48 条
  • [1] Cosine similarity measures of bipolar neutrosophic set for diagnosis of bipolar disorder diseases
    Abdel-Basset, Mohamed
    Mohamed, Mai
    Elhoseny, Mohamed
    Le Hoang Son
    Chiclana, Francisco
    Zaied, Abd El-Nasser H.
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2019, 101
  • [2] Chen T, 2020, PR MACH LEARN RES, V119
  • [3] Self-Supervised GANs via Auxiliary Rotation Loss
    Chen, Ting
    Zhai, Xiaohua
    Ritter, Marvin
    Lucic, Mario
    Houlsby, Neil
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 12146 - 12155
  • [4] Chen XL, 2020, Arxiv, DOI arXiv:2003.04297
  • [6] Thermographic Fault Diagnosis of Shaft of BLDC Motor
    Glowacz, Adam
    [J]. SENSORS, 2022, 22 (21)
  • [7] Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672
  • [8] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [9] Intra- and Inter-Slice Contrastive Learning for Point Supervised OCT Fluid Segmentation
    He, Xingxin
    Fang, Leyuan
    Tan, Mingkui
    Chen, Xiangdong
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1870 - 1881
  • [10] Henaff O., 2020, 37 INT C MACH LEARN, P4130