External Attention Based TransUNet and Label Expansion Strategy for Crack Detection

被引:38
作者
Fang, Jie [1 ,2 ]
Yang, Chen [3 ]
Shi, Yuetian [4 ,5 ]
Wang, Nan [4 ,5 ]
Zhao, Yang [6 ]
机构
[1] Xian Univ Posts & Telecommun, Sch Telecommun & Informat Engn, Xian 710121, Shaanxi, Peoples R China
[2] Corp Shaanxi Wukong Clouds Informat & Technol, Xian 710000, Shaanxi, Peoples R China
[3] Minist Sci & Technol, Pudong Dev Bank, Xian 710065, Shaanxi, Peoples R China
[4] Chinese Acad Sci, Xian Inst Opt & Precis Mech, Key Lab Spectral Imaging Technol CAS, Xian 710119, Shaanxi, Peoples R China
[5] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[6] Changan Univ, Coll Transportat Engn, Xian 710064, Shaanxi, Peoples R China
关键词
Feature extraction; Transformers; Roads; Mathematical models; Deep learning; Convolution; Semantics; Crack detection; TransUNet; external attention; label expansion; ALGORITHM; IMAGE;
D O I
10.1109/TITS.2022.3154407
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Crack detection is an indispensable premise of road maintenance, which can provide early warning information for many road damages and save repair costs to a large extent. Because of the security and convenience, many image processing technique (IPT) based crack detection methods have been proposed, but their performances often cannot meet the requirements of practical applications because of the complex texture structure and seriously imbalanced categories. To address the aforementioned problem, we present an external attention based TransUNet for crack detection. Specifically, we tackle the TransUNet as the backbone of our detection framework, which can propagate the detailed texture information from shallow layers to corresponding deep layers through skip connections. Besides, the Transformer Block equipped in the second last convolution layer of the encoding component can explicitly model the long-range dependency of different regions in an image, which improves the structural representation ability of the framework and hence alleviates the interference from shadow, noise, and other negative factors. In addition, the External Attention Block equipped in the last convolution layer of the encoding component can effectively exploit the dependency of crack regions among different images, and further enhance the robustness of the framework. Finally, combined with the Focal Loss, the proposed label expansion strategy can further alleviate the category imbalance problem through transforming semantic categories of non-crack pixels distributed in the neighbors of corresponding crack pixels.
引用
收藏
页码:19054 / 19063
页数:10
相关论文
共 41 条
  • [1] Analysis of edge-detection techniques for crack identification in bridges
    Abdel-Qader, L
    Abudayyeh, O
    Kelly, ME
    [J]. JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2003, 17 (04) : 255 - 263
  • [2] Automatic Crack Detection on Two-Dimensional Pavement Images: An Algorithm Based on Minimal Path Selection
    Amhaz, Rabih
    Chambon, Sylvie
    Idier, Jerome
    Baltazart, Vincent
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2016, 17 (10) : 2718 - 2729
  • [3] NB-CNN: Deep Learning-Based Crack Detection Using Convolutional Neural Network and Naive Bayes Data Fusion
    Chen, Fu-Chen
    Jahanshahi, Mohammad R.
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2018, 65 (05) : 4392 - 4400
  • [4] Chen J, ARXIV210204306, V2021
  • [5] Deep Category-Level and Regularized Hashing With Global Semantic Similarity Learning
    Chen, Yaxiong
    Lu, Xiaoqiang
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (12) : 6240 - 6252
  • [6] Cord A, 2012, COMPUT-AIDED CIV INF, V27, P244, DOI [10.1111/j.1467-8667.2011-00736.x, 10.1111/j.1467-8667.2011.00736.x]
  • [7] Delagnes P, 1995, INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOLS I-III, pA446
  • [8] Fang J., 2022, IEEE GEOSCI REMOTE S, V19, P1
  • [9] Distribution equalization learning mechanism for road crack detection
    Fang, Jie
    Qu, Bo
    Yuan, Yuan
    [J]. NEUROCOMPUTING, 2021, 424 : 193 - 204
  • [10] Deep Convolutional Neural Networks with transfer learning for computer vision-based data-driven pavement distress detection
    Gopalakrishnan, Kasthurirangan
    Khaitan, Siddhartha K.
    Choudhary, Alok
    Agrawal, Ankit
    [J]. CONSTRUCTION AND BUILDING MATERIALS, 2017, 157 : 322 - 330