Two-stage framework with improved U-Net based on self-supervised contrastive learning for pavement crack segmentation

被引：5

作者：

Song, Qingsong ^{[1
]}

Yao, Wei ^{[1
]}

Tian, Haojiang ^{[1
]}

Guo, Yidan ^{[1
]}

Muniyandi, Ravie Chandren ^{[2
]}

An, Yisheng ^{[1
]}

机构：

[1] Changan Univ, Sch Informat Engn, Xian 710064, Peoples R China

[2] Univ Kebangsaan Malaysia, Fac Informat Sci & Technol, Bangi 43600, Selangor, Malaysia

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2024年 / 238卷

关键词：

Pavement crack segmentation; Self-supervised contrastive learning; Pre-training; U-Net; Attention;

D O I：

10.1016/j.eswa.2023.122406

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

After the deep learning method emerged, the automated detection technology of pavement crack images has significantly progressed. The dominant approach is supervised deep learning, which relies on large-scale labeled ground truth. However, the problems are mostly unlabeled original crack images, which are difficult to fully utilize by the supervised deep learning network model. As a representative method of self-supervised learning, contrast learning can learn feature representations from unlabeled data, thus improving the accuracy of downstream tasks. This paper proposes a two-stage framework with improved U-Net based on self-supervised contrastive learning for pavement crack image segmentation. The framework takes improved U-Net as the basic architecture to highlight the significant features of the target segment of fine cracks. U-Net is improved by integrating the residual structure and attention mechanism in the typical U-Net architecture. The framework includes two learning stages: pre-training and fine-tuning. In the pre-training stage, the potential feature representation is learned from the unlabeled crack image. Crack images and pavement background images are used in the training data so that the model learns the distinguishable mapping relationship between crack and its background in the high-dimensional vector space without supervision comparison. In the fine-tuning stage, the network loads the parameters after the pre-training and uses the labeled training data for the retraining. Experimental results show that the proposed two-stage framework significantly improves the performance of crack segmentation accuracy without increasing the number of existing training samples and their labeling.

引用

页数：13

共 48 条

[1] Cosine similarity measures of bipolar neutrosophic set for diagnosis of bipolar disorder diseases
Abdel-Basset, Mohamed
Mohamed, Mai
Elhoseny, Mohamed
Le Hoang Son
Chiclana, Francisco
Zaied, Abd El-Nasser H.
[J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2019, 101
[2] Chen T, 2020, PR MACH LEARN RES, V119
[3] Self-Supervised GANs via Auxiliary Rotation Loss
Chen, Ting
Zhai, Xiaohua
Ritter, Marvin
Lucic, Mario
Houlsby, Neil
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 12146 - 12155
[4] Chen XL, 2020, Arxiv, DOI arXiv:2003.04297
[5] Thermographic fault diagnosis of electrical faults of commutator and induction motors
Glowacz, Adam
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 121
[6] Thermographic Fault Diagnosis of Shaft of BLDC Motor
Glowacz, Adam
[J]. SENSORS, 2022, 22 (21)
[7] Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672
[8] Deep Residual Learning for Image Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
[9] Intra- and Inter-Slice Contrastive Learning for Point Supervised OCT Fluid Segmentation
He, Xingxin
Fang, Leyuan
Tan, Mingkui
Chen, Xiangdong
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1870 - 1881
[10] Henaff O., 2020, 37 INT C MACH LEARN, P4130

← 1 2 3 4 5 →