PCTC-Net: A Crack Segmentation Network with Parallel Dual Encoder Network Fusing Pre-Conv-Based Transformers and Convolutional Neural Networks

被引:3
作者
Moon, Ji-Hwan [1 ]
Choi, Gyuho [1 ]
Kim, Yu-Hwan [2 ]
Kim, Won-Yeol [1 ]
机构
[1] Chosun Univ, Dept Artificial Intelligence Engn, Gwangju 61452, South Korea
[2] Chosun Univ, Dept Comp Engn, Gwangju 61452, South Korea
基金
新加坡国家研究基金会;
关键词
crack; segmentation; CNN; transformer; PCTC-Net; Pre-Conv; DAMAGE DETECTION; DEEP;
D O I
10.3390/s24051467
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Cracks are common defects that occur on the surfaces of objects and structures. Crack detection is a critical maintenance task that traditionally requires manual labor. Large-scale manual inspections are expensive. Research has been conducted to replace expensive human labor with cheaper computing resources. Recently, crack segmentation based on convolutional neural networks (CNNs) and transformers has been actively investigated for local and global information. However, the transformer is data-intensive owing to its weak inductive bias. Existing labeled datasets for crack segmentation are relatively small. Additionally, a limited amount of fine-grained crack data is available. To address this data-intensive problem, we propose a parallel dual encoder network fusing Pre-Conv-based Transformers and convolutional neural networks (PCTC-Net). The Pre-Conv module automatically optimizes each color channel with a small spatial kernel before the input of the transformer. The proposed model, PCTC-Net, was tested with the DeepCrack, Crack500, and Crackseg9k datasets. The experimental results showed that our model achieved higher generalization performance, stability, and F1 scores than the SOTA model DTrC-Net.
引用
收藏
页数:16
相关论文
共 45 条
[1]  
Alfarrarjeh A, 2018, IEEE INT CONF BIG DA, P5201, DOI 10.1109/BigData.2018.8621899
[2]  
Ali R., 2022, P 2022 IEEE S IND EL, P1, DOI [10.1109/ISIEA54517.2022.9873770, DOI 10.1109/ISIEA54517.2022.9873770]
[3]   Evaluation of deep learning approaches based on convolutional neural networks for corrosion detection [J].
Atha, Deegan J. ;
Jahanshahi, Mohammad R. .
STRUCTURAL HEALTH MONITORING-AN INTERNATIONAL JOURNAL, 2018, 17 (05) :1110-1128
[4]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[5]   Deep Learning-Based Crack Damage Detection Using Convolutional Neural Networks [J].
Cha, Young-Jin ;
Choi, Wooram ;
Buyukozturk, Oral .
COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2017, 32 (05) :361-378
[6]  
Chen Jieneng, 2021, arXiv, DOI DOI 10.48550/ARXIV.2102.04306
[7]  
Dosovitskiy A, 2021, Arxiv, DOI [arXiv:2010.11929, 10.48550/arXiv.2010.11929, DOI 10.48550/ARXIV.2010.11929]
[8]   Distribution equalization learning mechanism for road crack detection [J].
Fang, Jie ;
Qu, Bo ;
Yuan, Yuan .
NEUROCOMPUTING, 2021, 424 :193-204
[9]   A Survey on Vision Transformer [J].
Han, Kai ;
Wang, Yunhe ;
Chen, Hanting ;
Chen, Xinghao ;
Guo, Jianyuan ;
Liu, Zhenhua ;
Tang, Yehui ;
Xiao, An ;
Xu, Chunjing ;
Xu, Yixing ;
Yang, Zhaohui ;
Zhang, Yiman ;
Tao, Dacheng .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) :87-110
[10]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778