Robust crack detection in complex slab track scenarios using STC-YOLO and synthetic data with highly simulated modeling

被引：0

作者：

Hu, Wenbo ^{[1
]}

Liu, Xianhua ^{[1
]}

Zhou, Zhizhang ^{[1
]}

Wang, Weidong ^{[2
]}

Wu, Zheng ^{[3
]}

Chen, Zhengwei ^{[1
]}

机构：

[1] Hong Kong Polytech Univ, Dept Civil & Environm Engn, Hong Kong 999077, Peoples R China

[2] Cent South Univ, Sch Civil Engn, Changsha 410075, Peoples R China

[3] Guangdong Zhuzhao Railway Co Ltd, Guangzhou 510000, Peoples R China

来源：

AUTOMATION IN CONSTRUCTION | 2025年 / 175卷

基金：

中国国家自然科学基金;

关键词：

Image synthesis; Crack detection; Slab track; Virtual modeling; Deep learning;

D O I：

10.1016/j.autcon.2025.106219

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

Crack detection in slab tracks plays a crucial role in accident prevention. Existing algorithms primarily operate on monotonous concrete backgrounds and often struggle with data scarcity and complex scenes. This paper proposes a parametric slab track model replicating real-world inspection conditions through high-fidelity virtual simulation, enabling realistic synthetic crack data generation. The subsequently developed STC-YOLO network utilizes these synthetic images to enhance fine crack detection in complex slab track scenes. Results show that STC-YOLO trained on synthetic data (4:1 virtual-to-real ratio) achieves over 20 % improvements in both mAP and recall compared to using no virtual images, outperforming traditional augmentation methods like horizontal flipping and color dithering. Moreover, STC-YOLO exhibits over 6 % higher mAP than the baseline algorithm and surpasses five state-of-the-art object detection networks. The proposed algorithm greatly reduces the cost of data acquisition.

引用

页数：17

共 56 条

[21] SAFFNet: Self-Attention-Based Feature Fusion Network for Remote Sensing Few-Shot Scene Classification
Kim, Joseph
Chi, Mingmin
[J]. REMOTE SENSING, 2021, 13 (13)
[22] Cross-scene pavement distress detection by a novel transfer learning framework
Li, Yishun
Che, Pengyu
Liu, Chenglong
Wu, Difei
Du, Yuchuan
[J]. COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2021, 36 (11) : 1398 - 1415
[23] Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Liu, Ze
Lin, Yutong
Cao, Yue
Hu, Han
Wei, Yixuan
Zhang, Zheng
Lin, Stephen
Guo, Baining
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9992 - 10002
[24] Generative adversarial network for road damage detection
Maeda, Hiroya
Kashiyama, Takehiro
Sekimoto, Yoshihide
Seto, Toshikazu
Omata, Hiroshi
[J]. COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2021, 36 (01) : 47 - 60
[25] Realtime conversion of cracks from pixel to engineering scale using Augmented Reality
Malek, Kaveh
Moreu, Fernando
[J]. AUTOMATION IN CONSTRUCTION, 2022, 143
[26] A cost effective solution for pavement crack inspection using cameras and deep neural networks
Mei, Qipei
Gul, Mustafa
[J]. CONSTRUCTION AND BUILDING MATERIALS, 2020, 256 (256)
[27] Balanced single-shot object detection using cross-context attention-guided network
Miao, Shuyu
Du, Shanshan
Feng, Rui
Zhang, Yuejie
Li, Huayu
Liu, Tianbi
Zheng, Lin
Fan, Weiguo
[J]. PATTERN RECOGNITION, 2022, 122
[28] Synthetic data generation using finite element method to pre-train an image segmentation model for defect detection using infrared thermography
Pareek, Kaushal Arun
May, Daniel
Meszmer, Peter
Ras, Mohamad Abo
Wunderle, Bernhard
[J]. JOURNAL OF INTELLIGENT MANUFACTURING, 2025, 36 (03) : 1879 - 1905
[29] Virtual generation of pavement crack images based on improved deep convolutional generative adversarial network
Pei, Lili
Sun, Zhaoyun
Xiao, Liyang
Li, Wei
Sun, Jing
Zhang, He
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 104
[30] Multi-instance attention network for few-shot learning
Qin, Zhili
Wang, Han
Mawuli, Cobbinah Bernard
Han, Wei
Zhang, Rui
Yang, Qinli
Shao, Junming
[J]. INFORMATION SCIENCES, 2022, 611 : 464 - 475

← 1 2 3 4 5 6 →