Robust crack detection in complex slab track scenarios using STC-YOLO and synthetic data with highly simulated modeling

被引:0
作者
Hu, Wenbo [1 ]
Liu, Xianhua [1 ]
Zhou, Zhizhang [1 ]
Wang, Weidong [2 ]
Wu, Zheng [3 ]
Chen, Zhengwei [1 ]
机构
[1] Hong Kong Polytech Univ, Dept Civil & Environm Engn, Hong Kong 999077, Peoples R China
[2] Cent South Univ, Sch Civil Engn, Changsha 410075, Peoples R China
[3] Guangdong Zhuzhao Railway Co Ltd, Guangzhou 510000, Peoples R China
基金
中国国家自然科学基金;
关键词
Image synthesis; Crack detection; Slab track; Virtual modeling; Deep learning;
D O I
10.1016/j.autcon.2025.106219
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Crack detection in slab tracks plays a crucial role in accident prevention. Existing algorithms primarily operate on monotonous concrete backgrounds and often struggle with data scarcity and complex scenes. This paper proposes a parametric slab track model replicating real-world inspection conditions through high-fidelity virtual simulation, enabling realistic synthetic crack data generation. The subsequently developed STC-YOLO network utilizes these synthetic images to enhance fine crack detection in complex slab track scenes. Results show that STC-YOLO trained on synthetic data (4:1 virtual-to-real ratio) achieves over 20 % improvements in both mAP and recall compared to using no virtual images, outperforming traditional augmentation methods like horizontal flipping and color dithering. Moreover, STC-YOLO exhibits over 6 % higher mAP than the baseline algorithm and surpasses five state-of-the-art object detection networks. The proposed algorithm greatly reduces the cost of data acquisition.
引用
收藏
页数:17
相关论文
共 56 条
  • [21] SAFFNet: Self-Attention-Based Feature Fusion Network for Remote Sensing Few-Shot Scene Classification
    Kim, Joseph
    Chi, Mingmin
    [J]. REMOTE SENSING, 2021, 13 (13)
  • [22] Cross-scene pavement distress detection by a novel transfer learning framework
    Li, Yishun
    Che, Pengyu
    Liu, Chenglong
    Wu, Difei
    Du, Yuchuan
    [J]. COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2021, 36 (11) : 1398 - 1415
  • [23] Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
    Liu, Ze
    Lin, Yutong
    Cao, Yue
    Hu, Han
    Wei, Yixuan
    Zhang, Zheng
    Lin, Stephen
    Guo, Baining
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9992 - 10002
  • [24] Generative adversarial network for road damage detection
    Maeda, Hiroya
    Kashiyama, Takehiro
    Sekimoto, Yoshihide
    Seto, Toshikazu
    Omata, Hiroshi
    [J]. COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2021, 36 (01) : 47 - 60
  • [25] Realtime conversion of cracks from pixel to engineering scale using Augmented Reality
    Malek, Kaveh
    Moreu, Fernando
    [J]. AUTOMATION IN CONSTRUCTION, 2022, 143
  • [26] A cost effective solution for pavement crack inspection using cameras and deep neural networks
    Mei, Qipei
    Gul, Mustafa
    [J]. CONSTRUCTION AND BUILDING MATERIALS, 2020, 256 (256)
  • [27] Balanced single-shot object detection using cross-context attention-guided network
    Miao, Shuyu
    Du, Shanshan
    Feng, Rui
    Zhang, Yuejie
    Li, Huayu
    Liu, Tianbi
    Zheng, Lin
    Fan, Weiguo
    [J]. PATTERN RECOGNITION, 2022, 122
  • [28] Synthetic data generation using finite element method to pre-train an image segmentation model for defect detection using infrared thermography
    Pareek, Kaushal Arun
    May, Daniel
    Meszmer, Peter
    Ras, Mohamad Abo
    Wunderle, Bernhard
    [J]. JOURNAL OF INTELLIGENT MANUFACTURING, 2025, 36 (03) : 1879 - 1905
  • [29] Virtual generation of pavement crack images based on improved deep convolutional generative adversarial network
    Pei, Lili
    Sun, Zhaoyun
    Xiao, Liyang
    Li, Wei
    Sun, Jing
    Zhang, He
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 104
  • [30] Multi-instance attention network for few-shot learning
    Qin, Zhili
    Wang, Han
    Mawuli, Cobbinah Bernard
    Han, Wei
    Zhang, Rui
    Yang, Qinli
    Shao, Junming
    [J]. INFORMATION SCIENCES, 2022, 611 : 464 - 475