Multi-scale triple-attention network for pixelwise crack segmentation

被引:38
作者
Yang, Lei [1 ,2 ]
Bai, Suli [1 ,2 ]
Liu, Yanhong [1 ,2 ]
Yu, Hongnian [2 ,3 ]
机构
[1] Zhengzhou Univ, Sch Elect & Informat Engn, Henan 450001, Peoples R China
[2] Robot Percept & Control Engn Lab Henan Prov, Zhengzhou 450001, Henan, Peoples R China
[3] Edinburgh Napier Univ, Built Environm, Edinburgh EH10 5DT, Scotland
关键词
Pavement crack segmentation; Semantic segmentation; Residual network; Multiscale input strategy; Deep supervision mechanism;
D O I
10.1016/j.autcon.2023.104853
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Currently, intelligent crack detection is of great value for the maintenance of infrastructure, of which the most significant kind in China is roads. For pavement defects, the pavement can be repaired and maintained in a timely manner with an accurate defect detection task, which significantly reduces the occurrence of hazards. However, the detection of pavement defects remains a great challenge owing to many difficulties, for example, complex backgrounds, microdefects, various defect shapes and sizes, class imbalance issues, etc. Recently, deep learning has demonstrated its superior performance on pixelwise image segmentation, but some issues still exist on demanding pixelwise image segmentation, for instance, limited receptive field, insufficiency processing of local features, information loss issue generated by pooling operations, etc. Based on all of the above issues, a multiscale triple-attention network, named MST-Net, is proposed for end-to-end pixelwise crack detection. First, a multiscale input strategy is applied to the proposed segmentation network to capture more context information. Meanwhile, it can capably reduce the effect of the information loss issue generated by pooling operations. Second, to realize effective feature representation of local features, an additive attention fusion (AAF) block is proposed to guide feature learning to capture both global and local contexts. In addition, faced with the crack detection task with class imbalance issues, a triple attention (TA) block is proposed to detect spatial, channel and pixel attention information to suppress the background and useless information, which is conducive to the characterization of microcracks. Finally, aiming at the limited receptive field, a multiscale feature aggregation unit is proposed for feature fusion to increase the detection ability of multiscale defects. To better guide network training, a deep supervision mechanism is also introduced to speed up the convergence of the proposed segmentation model and improve the performance of defect segmentation. The related evaluation and detection experiments are carried out on three public datasets on crack segmentation, and the comparison experiments with the mainstream segmentation models show that the proposed segmentation network achieves excellent performance on pixelwise crack detection.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Multi-Scale Convolutional Features Network for Semantic Segmentation in Indoor Scenes
    Wang, Yanran
    Chen, Qingliang
    Chen, Shilang
    Wu, Junjun
    IEEE ACCESS, 2020, 8 : 89575 - 89583
  • [42] Desert classification based on a multi-scale residual network with an attention mechanism
    Liguo Weng
    Lexuan Wang
    Min Xia
    Huixiang Shen
    Jia Liu
    Yiqing Xu
    Geosciences Journal, 2021, 25 : 387 - 399
  • [43] Desert classification based on a multi-scale residual network with an attention mechanism
    Weng, Liguo
    Wang, Lexuan
    Xia, Min
    Shen, Huixiang
    Liu, Jia
    Xu, Yiqing
    GEOSCIENCES JOURNAL, 2021, 25 (03) : 387 - 399
  • [44] Segmentation of crack disaster images based on feature extraction enhancement and multi-scale fusion
    Wang, Letian
    Wu, Gengkun
    Tossou, Akpedje Ingrid Hermilda C. F.
    Liang, Zengwei
    Xu, Jie
    EARTH SCIENCE INFORMATICS, 2025, 18 (01)
  • [45] Semantic segmentation of autonomous driving scenes based on multi-scale adaptive attention mechanism
    Liu, Danping
    Zhang, Dong
    Wang, Lei
    Wang, Jun
    FRONTIERS IN NEUROSCIENCE, 2023, 17
  • [46] GourmetNet: Food Segmentation Using Multi-Scale Waterfall Features with Spatial and Channel Attention
    Sharma, Udit
    Artacho, Bruno
    Savakis, Andreas
    SENSORS, 2021, 21 (22)
  • [47] An agricultural leaf disease segmentation model applying multi-scale coordinate attention mechanism
    Gu, Renyuan
    Liu, Liqun
    APPLIED SOFT COMPUTING, 2025, 172
  • [48] Deeply supervised breast cancer segmentation combined with multi-scale and attention-residuals
    Qin C.-B.
    Song Z.-Y.
    Zeng J.-Y.
    Tian L.-F.
    Li F.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2021, 29 (04): : 877 - 895
  • [49] Ground Crack Recognition Based on Fully Convolutional Network With Multi-Scale Input
    Cheng, Jian
    Ye, Liang
    Guo, Yinan
    Zhang, Jun
    An, Hongbo
    IEEE ACCESS, 2020, 8 : 53034 - 53048
  • [50] MFFLNet: lightweight semantic segmentation network based on multi-scale feature fusion
    Wei Depeng
    Wang Huabin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (10) : 30073 - 30093