Multi-scale triple-attention network for pixelwise crack segmentation

被引:38
作者
Yang, Lei [1 ,2 ]
Bai, Suli [1 ,2 ]
Liu, Yanhong [1 ,2 ]
Yu, Hongnian [2 ,3 ]
机构
[1] Zhengzhou Univ, Sch Elect & Informat Engn, Henan 450001, Peoples R China
[2] Robot Percept & Control Engn Lab Henan Prov, Zhengzhou 450001, Henan, Peoples R China
[3] Edinburgh Napier Univ, Built Environm, Edinburgh EH10 5DT, Scotland
关键词
Pavement crack segmentation; Semantic segmentation; Residual network; Multiscale input strategy; Deep supervision mechanism;
D O I
10.1016/j.autcon.2023.104853
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Currently, intelligent crack detection is of great value for the maintenance of infrastructure, of which the most significant kind in China is roads. For pavement defects, the pavement can be repaired and maintained in a timely manner with an accurate defect detection task, which significantly reduces the occurrence of hazards. However, the detection of pavement defects remains a great challenge owing to many difficulties, for example, complex backgrounds, microdefects, various defect shapes and sizes, class imbalance issues, etc. Recently, deep learning has demonstrated its superior performance on pixelwise image segmentation, but some issues still exist on demanding pixelwise image segmentation, for instance, limited receptive field, insufficiency processing of local features, information loss issue generated by pooling operations, etc. Based on all of the above issues, a multiscale triple-attention network, named MST-Net, is proposed for end-to-end pixelwise crack detection. First, a multiscale input strategy is applied to the proposed segmentation network to capture more context information. Meanwhile, it can capably reduce the effect of the information loss issue generated by pooling operations. Second, to realize effective feature representation of local features, an additive attention fusion (AAF) block is proposed to guide feature learning to capture both global and local contexts. In addition, faced with the crack detection task with class imbalance issues, a triple attention (TA) block is proposed to detect spatial, channel and pixel attention information to suppress the background and useless information, which is conducive to the characterization of microcracks. Finally, aiming at the limited receptive field, a multiscale feature aggregation unit is proposed for feature fusion to increase the detection ability of multiscale defects. To better guide network training, a deep supervision mechanism is also introduced to speed up the convergence of the proposed segmentation model and improve the performance of defect segmentation. The related evaluation and detection experiments are carried out on three public datasets on crack segmentation, and the comparison experiments with the mainstream segmentation models show that the proposed segmentation network achieves excellent performance on pixelwise crack detection.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Multi-Scale Kolmogorov-Arnold Network (KAN)-Based Linear Attention Network: Multi-Scale Feature Fusion with KAN and Deformable Convolution for Urban Scene Image Semantic Segmentation
    Li, Yuanhang
    Liu, Shuo
    Wu, Jie
    Sun, Weichao
    Wen, Qingke
    Wu, Yibiao
    Qin, Xiujuan
    Qiao, Yanyou
    REMOTE SENSING, 2025, 17 (05)
  • [32] A Road Crack Segmentation Method Based on Transformer and Multi-Scale Feature Fusion
    Xu, Yang
    Xia, Yonghua
    Zhao, Quai
    Yang, Kaihua
    Li, Qiang
    ELECTRONICS, 2024, 13 (12)
  • [33] Semantic Segmentation Network Based on Adaptive Attention and Deep Fusion Utilizing a Multi-Scale Dilated Convolutional Pyramid
    Zhao, Shan
    Wang, Zihao
    Huo, Zhanqiang
    Zhang, Fukai
    SENSORS, 2024, 24 (16)
  • [34] MSCSA-Net: Multi-Scale Channel Spatial Attention Network for Semantic Segmentation of Remote Sensing Images
    Liu, Kuan-Hsien
    Lin, Bo-Yen
    APPLIED SCIENCES-BASEL, 2023, 13 (17):
  • [35] MRASFusion: A multi-scale residual attention infrared and visible image fusion network based on semantic segmentation guidance
    An, Rongsheng
    Liu, Gang
    Qian, Yao
    Xing, Mengliang
    Tang, Haojie
    INFRARED PHYSICS & TECHNOLOGY, 2024, 139
  • [36] Multi-scale Wavelet Frequency Channel Attention for Remote Sensing Image Segmentation
    Su, Yu-Chen
    Liu, Tsung-Jung
    Liuy, Kuan-Hsien
    2022 IEEE 14TH IMAGE, VIDEO, AND MULTIDIMENSIONAL SIGNAL PROCESSING WORKSHOP (IVMSP), 2022,
  • [37] Multi-Scale Attention Convolutional Network for Masson Stained Bile Duct Segmentation from Liver Pathology Images
    Su, Chun-Han
    Chung, Pau-Choo
    Lin, Sheng-Fung
    Tsai, Hung-Wen
    Yang, Tsung-Lung
    Su, Yu-Chieh
    SENSORS, 2022, 22 (07)
  • [38] Deep Attention and Multi-Scale Networks for Accurate Remote Sensing Image Segmentation
    Qi, Xingqun
    Li, Kaiqi
    Liu, Pengkun
    Zhou, Xiaoguang
    Sun, Muyi
    IEEE ACCESS, 2020, 8 (08): : 146627 - 146639
  • [39] Multi-scale Generative Adversarial Network for Automatic Sublingual Vein Segmentation
    Xiong, Qingyue
    Li, Xinlei
    Yang, Dawei
    Zhang, Wei
    Zhang, Ye
    Kong, Yajie
    Li, Fufeng
    Zhang, Wenqiang
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 851 - 856
  • [40] Multi-Scale Feature Aggregation Network for Semantic Segmentation of Land Cover
    Shen, Xu
    Weng, Liguo
    Xia, Min
    Lin, Haifeng
    REMOTE SENSING, 2022, 14 (23)