TransMF: Transformer-Based Multi-Scale Fusion Model for Crack Detection

被引:11
|
作者
Ju, Xiaochen [1 ]
Zhao, Xinxin [1 ]
Qian, Shengsheng [2 ]
机构
[1] China Acad Railway Sci Corp Ltd, Railway Engn Res Inst, Beijing 100081, Peoples R China
[2] Chinese Acad Sci, Inst Automat, Beijing 100090, Peoples R China
关键词
crack detection; convolutional neural network; transformer; multi-scale fusion;
D O I
10.3390/math10132354
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Cracks are widespread in infrastructure that are closely related to human activity. It is very popular to use artificial intelligence to detect cracks intelligently, which is known as crack detection. The noise in the background of crack images, discontinuity of cracks and other problems make the crack detection task a huge challenge. Although many approaches have been proposed, there are still two challenges: (1) cracks are long and complex in shape, making it difficult to capture long-range continuity; (2) most of the images in the crack dataset have noise, and it is difficult to detect only the cracks and ignore the noise. In this paper, we propose a novel method called Transformer-based Multi-scale Fusion Model (TransMF) for crack detection, including an Encoder Module (EM), Decoder Module (DM) and Fusion Module (FM). The Encoder Module uses a hybrid of convolution blocks and Swin Transformer block to model the long-range dependencies of different parts in a crack image from a local and global perspective. The Decoder Module is designed with symmetrical structure to the Encoder Module. In the Fusion Module, the output in each layer with unique scales of Encoder Module and Decoder Module are fused in the form of convolution, which can release the effect of background noise and strengthen the correlations between relevant context in order to enhance the crack detection. Finally, the output of each layer of the Fusion Module is concatenated to achieve the purpose of crack detection. Extensive experiments on three benchmark datasets (CrackLS315, CRKWH100 and DeepCrack) demonstrate that the proposed TransMF in this paper exceeds the best performance of present baselines.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Multi-scale feature fusion for pavement crack detection based on Transformer
    Yang, Yalong
    Niu, Zhen
    Su, Liangliang
    Xu, Wenjing
    Wang, Yuanhang
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (08) : 14920 - 14937
  • [2] Transformer-based multi-scale feature fusion network for remote sensing change detection
    Liang, Shike
    Hua, Zhen
    Li, Jinjiang
    JOURNAL OF APPLIED REMOTE SENSING, 2022, 16 (04)
  • [3] TFNet: Transformer-Based Multi-Scale Feature Fusion Forest Fire Image Detection Network
    Liu, Hongying
    Zhang, Fuquan
    Xu, Yiqing
    Wang, Junling
    Lu, Hong
    Wei, Wei
    Zhu, Jun
    FIRE-SWITZERLAND, 2025, 8 (02):
  • [4] A Road Crack Segmentation Method Based on Transformer and Multi-Scale Feature Fusion
    Xu, Yang
    Xia, Yonghua
    Zhao, Quai
    Yang, Kaihua
    Li, Qiang
    ELECTRONICS, 2024, 13 (12)
  • [5] Transformer-Based Multi-Scale Feature Remote Sensing Image Classification Model
    Sun, Ting
    Li, Jun
    Zhou, Xiangrui
    Chen, Zan
    IEEE ACCESS, 2025, 13 : 34095 - 34104
  • [6] Multi-scale Feature Fusion Object Detection Based on Swin Transformer
    Zhang, Ying
    Wu, Lin
    Deng, Huaxuan
    Hu, Jun
    Li, Xifan
    39TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION, YAC 2024, 2024, : 1982 - 1987
  • [7] Bridge crack detection method based on multi-scale feature fusion
    Wang, Yubian
    Zou, Chengzheng
    Song, Yajuan
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON ALGORITHMS, SOFTWARE ENGINEERING, AND NETWORK SECURITY, ASENS 2024, 2024, : 559 - 562
  • [8] Transformer-based Multi-scale Underwater Image Enhancement Network
    Yang, Ai-Ping
    Fang, Si-Jie
    Shao, Ming-Fu
    Zhang, Teng-Fei
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2024, 45 (12): : 1696 - 1705
  • [9] PlaceFormer: Transformer-Based Visual Place Recognition Using Multi-Scale Patch Selection and Fusion
    Kannan, Shyam Sundar
    Min, Byung-Cheol
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (07): : 6552 - 6559
  • [10] MULTI-SCALE TRANSFORMER-BASED FEATURE COMBINATION FOR IMAGE RETRIEVAL
    Roig Mari, Carlos
    Varas Gonzalez, David
    Bou-Balust, Elisenda
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3166 - 3170