Multi-scale triple-attention network for pixelwise crack segmentation

被引：38

作者：

Yang, Lei ^{[1
,2
]}

Bai, Suli ^{[1
,2
]}

Liu, Yanhong ^{[1
,2
]}

Yu, Hongnian ^{[2
,3
]}

机构：

[1] Zhengzhou Univ, Sch Elect & Informat Engn, Henan 450001, Peoples R China

[2] Robot Percept & Control Engn Lab Henan Prov, Zhengzhou 450001, Henan, Peoples R China

[3] Edinburgh Napier Univ, Built Environm, Edinburgh EH10 5DT, Scotland

来源：

AUTOMATION IN CONSTRUCTION | 2023年 / 150卷

关键词：

Pavement crack segmentation; Semantic segmentation; Residual network; Multiscale input strategy; Deep supervision mechanism;

D O I：

10.1016/j.autcon.2023.104853

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

Currently, intelligent crack detection is of great value for the maintenance of infrastructure, of which the most significant kind in China is roads. For pavement defects, the pavement can be repaired and maintained in a timely manner with an accurate defect detection task, which significantly reduces the occurrence of hazards. However, the detection of pavement defects remains a great challenge owing to many difficulties, for example, complex backgrounds, microdefects, various defect shapes and sizes, class imbalance issues, etc. Recently, deep learning has demonstrated its superior performance on pixelwise image segmentation, but some issues still exist on demanding pixelwise image segmentation, for instance, limited receptive field, insufficiency processing of local features, information loss issue generated by pooling operations, etc. Based on all of the above issues, a multiscale triple-attention network, named MST-Net, is proposed for end-to-end pixelwise crack detection. First, a multiscale input strategy is applied to the proposed segmentation network to capture more context information. Meanwhile, it can capably reduce the effect of the information loss issue generated by pooling operations. Second, to realize effective feature representation of local features, an additive attention fusion (AAF) block is proposed to guide feature learning to capture both global and local contexts. In addition, faced with the crack detection task with class imbalance issues, a triple attention (TA) block is proposed to detect spatial, channel and pixel attention information to suppress the background and useless information, which is conducive to the characterization of microcracks. Finally, aiming at the limited receptive field, a multiscale feature aggregation unit is proposed for feature fusion to increase the detection ability of multiscale defects. To better guide network training, a deep supervision mechanism is also introduced to speed up the convergence of the proposed segmentation model and improve the performance of defect segmentation. The related evaluation and detection experiments are carried out on three public datasets on crack segmentation, and the comparison experiments with the mainstream segmentation models show that the proposed segmentation network achieves excellent performance on pixelwise crack detection.

引用

页数：12

共 50 条

[31] Multi-Scale Kolmogorov-Arnold Network (KAN)-Based Linear Attention Network: Multi-Scale Feature Fusion with KAN and Deformable Convolution for Urban Scene Image Semantic Segmentation
Li, Yuanhang
Liu, Shuo
Wu, Jie
Sun, Weichao
Wen, Qingke
Wu, Yibiao
Qin, Xiujuan
Qiao, Yanyou
REMOTE SENSING, 2025, 17 (05)
[32] A Road Crack Segmentation Method Based on Transformer and Multi-Scale Feature Fusion
Xu, Yang
Xia, Yonghua
Zhao, Quai
Yang, Kaihua
Li, Qiang
ELECTRONICS, 2024, 13 (12)
[33] Semantic Segmentation Network Based on Adaptive Attention and Deep Fusion Utilizing a Multi-Scale Dilated Convolutional Pyramid
Zhao, Shan
Wang, Zihao
Huo, Zhanqiang
Zhang, Fukai
SENSORS, 2024, 24 (16)
[34] MSCSA-Net: Multi-Scale Channel Spatial Attention Network for Semantic Segmentation of Remote Sensing Images
Liu, Kuan-Hsien
Lin, Bo-Yen
APPLIED SCIENCES-BASEL, 2023, 13 (17):
[35] MRASFusion: A multi-scale residual attention infrared and visible image fusion network based on semantic segmentation guidance
An, Rongsheng
Liu, Gang
Qian, Yao
Xing, Mengliang
Tang, Haojie
INFRARED PHYSICS & TECHNOLOGY, 2024, 139
[36] Multi-scale Wavelet Frequency Channel Attention for Remote Sensing Image Segmentation
Su, Yu-Chen
Liu, Tsung-Jung
Liuy, Kuan-Hsien
2022 IEEE 14TH IMAGE, VIDEO, AND MULTIDIMENSIONAL SIGNAL PROCESSING WORKSHOP (IVMSP), 2022,
[37] Multi-Scale Attention Convolutional Network for Masson Stained Bile Duct Segmentation from Liver Pathology Images
Su, Chun-Han
Chung, Pau-Choo
Lin, Sheng-Fung
Tsai, Hung-Wen
Yang, Tsung-Lung
Su, Yu-Chieh
SENSORS, 2022, 22 (07)
[38] Deep Attention and Multi-Scale Networks for Accurate Remote Sensing Image Segmentation
Qi, Xingqun
Li, Kaiqi
Liu, Pengkun
Zhou, Xiaoguang
Sun, Muyi
IEEE ACCESS, 2020, 8 (08): : 146627 - 146639
[39] Multi-scale Generative Adversarial Network for Automatic Sublingual Vein Segmentation
Xiong, Qingyue
Li, Xinlei
Yang, Dawei
Zhang, Wei
Zhang, Ye
Kong, Yajie
Li, Fufeng
Zhang, Wenqiang
2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 851 - 856
[40] Multi-Scale Feature Aggregation Network for Semantic Segmentation of Land Cover
Shen, Xu
Weng, Liguo
Xia, Min
Lin, Haifeng
REMOTE SENSING, 2022, 14 (23)

← 1 2 3 4 5 →