A Hybrid Approach for Pavement Crack Detection Using Mask R-CNN and Vision Transformer Model

被引:0
作者
Alshawabkeh, Shorouq [1 ]
Wu, Li [1 ]
Dong, Daojun [1 ]
Cheng, Yao [1 ]
Li, Liping [1 ]
机构
[1] China Univ Geosci, Fac Engn, Wuhan 430074, Peoples R China
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2025年 / 82卷 / 01期
关键词
Pavement crack segmentation; transportation; deep learning; vision transformer; Mask R-CNN; image segmentation;
D O I
10.32604/cmc.2024.057213
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Detecting pavement cracks is critical for road safety and infrastructure management. Traditional methods, relying on manual inspection and basic image processing, are time-consuming and prone to errors. Recent deep-learning (DL) methods automate crack detection, but many still struggle with variable crack patterns and environmental conditions. This study aims to address these limitations by introducing the MaskerTransformer, a novel hybrid deep learning model that integrates the precise localization capabilities of Mask Region-based Convolutional Neural Network (Mask R-CNN) with the global contextual awareness of Vision Transformer (ViT). The research focuses on leveraging the strengths of both architectures to enhance segmentation accuracy and adaptability across different pavement conditions. We evaluated the performance of the MaskerTransformer against other state-of-the(Swin-UNETr), You Only Look Once version 8 (YoloV8), and Mask R-CNN using two benchmark datasets: Crack500 and DeepCrack. The findings reveal that the MaskerTransformer significantly outperforms the existing models, achieving the highest Dice Similarity Coefficient (DSC), precision, recall, and F1-Score across both datasets. Specifically, the model attained a DSC of 80.04% on Crack500 and 91.37% on DeepCrack, demonstrating superior segmentation accuracy and reliability. The high precision and recall rates further substantiate its effectiveness in real-world applications, suggesting that the MaskerTransformer can serve as a robust tool for automated pavement crack detection, potentially replacing more traditional methods.
引用
收藏
页码:561 / 577
页数:17
相关论文
共 31 条
  • [1] Computer vision framework for crack detection of civil infrastructure-A review
    Ai, Dihao
    Jiang, Guiyuan J.
    Lam, Siew-Kei
    He, Peilan
    Li, Chengwu
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 117
  • [2] Automated Pavement Crack Detection Using Deep Feature Selection and Whale Optimization Algorithm
    Alshawabkeh, Shorouq
    Wu, Li
    Dong, Daojun
    Cheng, Yao
    Li, Liping
    Alanaqreh, Mohammad
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 77 (01): : 63 - 77
  • [3] [Anonymous], 2024, Kaggle
  • [4] Unmasking Deception: Empowering Deepfake Detection with Vision Transformer Network
    Arshed, Muhammad Asad
    Alwadain, Ayed
    Ali, Rao Faizan
    Mumtaz, Shahzad
    Ibrahim, Muhammad
    Muneer, Amgad
    [J]. MATHEMATICS, 2023, 11 (17)
  • [5] Improved Pixel-Level Pavement-Defect Segmentation Using a Deep Autoencoder
    Augustauskas, Rytis
    Lipnickas, Arunas
    [J]. SENSORS, 2020, 20 (09)
  • [6] Bharati Puja, 2020, Computational Intelligence in Pattern Recognition. Proceedings of CIPR 2019. Advances in Intelligent Systems and Computing (AISC 999), P657, DOI 10.1007/978-981-13-9042-5_56
  • [7] Review of Pavement Defect Detection Methods
    Cao, Wenming
    Liu, Qifan
    He, Zhiquan
    [J]. IEEE ACCESS, 2020, 8 : 14531 - 14544
  • [8] Deep Learning for Image Enhancement and Correction in Magnetic Resonance Imaging-State-of-the-Art and Challenges
    Chen, Zhaolin
    Pawar, Kamlesh
    Ekanayake, Mevan
    Pain, Cameron
    Zhong, Shenjun
    Egan, Gary F.
    [J]. JOURNAL OF DIGITAL IMAGING, 2023, 36 (01) : 204 - 230
  • [9] Structural health monitoring of asphalt pavements using smart sensor networks: A comprehensive review
    Di Graziano, Alessandro
    Marchetta, Vincenzo
    Cafiso, Salvatore
    [J]. JOURNAL OF TRAFFIC AND TRANSPORTATION ENGINEERING-ENGLISH EDITION, 2020, 7 (05) : 639 - 651
  • [10] Automated Detection and Classification of Pavement Distresses using 3D Pavement Surface Images and Deep Learning
    Ghosh, Rohit
    Smadi, Omar
    [J]. TRANSPORTATION RESEARCH RECORD, 2021, 2675 (09) : 1359 - 1374