A novel transformer-based semantic segmentation framework for structural condition assessment

被引:17
作者
Wang, Ruhua [1 ]
Shao, Yanda [2 ]
Li, Qilin [2 ,3 ]
Li, Ling [2 ]
Li, Jun [1 ,4 ]
Hao, Hong [1 ]
机构
[1] Curtin Univ, Sch Civil & Mech Engn, Ctr Infrastruct Monitoring & Protect, Bentley, WA, Australia
[2] Curtin Univ, Sch Elect Engn Comp & Math Sci, Discipline Comp, Bentley, WA, Australia
[3] Curtin Univ, Sch Elect Engn Comp & Math Sci, Discipline Comp, Kent St, Bentley, WA 6102, Australia
[4] Curtin Univ, Sch Civil & Mech Engn, Ctr Infrastruct Monitoring & Protect, Kent St, Bentley, WA 6102, Australia
来源
STRUCTURAL HEALTH MONITORING-AN INTERNATIONAL JOURNAL | 2024年 / 23卷 / 02期
关键词
Semantic segmentation; structural health monitoring; transformer; deep learning; condition assessment; DAMAGE DETECTION;
D O I
10.1177/14759217231182303
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Conventional structural health monitoring (SHM) evaluates the condition of civil structures by analyzing the data acquired by advanced sensors. The requirement of overinvestment in specialized equipment and labor for implementation prevents the traditional SHM from large-scale usage. On the other hand, computer vision techniques offer cost-effective solutions for SHM thanks to its inherent advantage in data acquirement and processing. More importantly, it has been demonstrated that these emerging solutions can produce reliable condition diagnoses for civil structures using pure image data. In this article, a novel transformer-based neural network is proposed for vision-based structural condition assessment which is formulated to a semantic segmentation problem. The network employs Swin Transformer as the backbone and MaskFormer as the overall architecture to recognize components (sleepers, slabs, columns, etc.) and damage (concrete damage, exposed rebar) of structures. Unlike the commonly used fully convolutional networks, the proposed model tackles semantic segmentation as a mask classification rather than a pixel classification problem. To deal with the lack of training data, an image data augmentation method called Copy-Paste is extended and applied for training data generation, resulting in an increase of around 40% data for component segmentation and 71% data for damage segmentation. Experimental validations on the Tokaido railway viaduct dataset show that the proposed approach is very accurate, achieving 97% and 90% mean Intersection Over Union for component and damage segmentation, outperforming the existing methods by a significant margin. The accurate segmentation results can provide meaningful information for downstream SHM tasks.
引用
收藏
页码:1170 / 1183
页数:14
相关论文
共 57 条
[1]   A review of vibration-based damage detection in civil structures: From traditional methods to Machine Learning and Deep Learning applications [J].
Avci, Onur ;
Abdeljaber, Osama ;
Kiranyaz, Serkan ;
Hussein, Mohammed ;
Gabbouj, Moncef ;
Inman, Daniel J. .
MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2021, 147
[2]   Structural health monitoring using extremely compressed data through deep learning [J].
Azimi, Mohsen ;
Pekcan, Gokhan .
COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2020, 35 (06) :597-614
[3]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[4]   Engineering deep learning methods on automatic detection of damage in infrastructure due to extreme events [J].
Bai, Yongsheng ;
Zha, Bing ;
Sezen, Halil ;
Yilmaz, Alper .
STRUCTURAL HEALTH MONITORING-AN INTERNATIONAL JOURNAL, 2023, 22 (01) :338-352
[5]  
Brown TB, 2020, ADV NEUR IN, V33
[6]   End-to-End Object Detection with Transformers [J].
Carion, Nicolas ;
Massa, Francisco ;
Synnaeve, Gabriel ;
Usunier, Nicolas ;
Kirillov, Alexander ;
Zagoruyko, Sergey .
COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229
[7]   Deep Learning-Based Crack Damage Detection Using Convolutional Neural Networks [J].
Cha, Young-Jin ;
Choi, Wooram ;
Buyukozturk, Oral .
COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2017, 32 (05) :361-378
[8]  
Cheng B, 2021, ADV NEUR IN, V34
[9]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[10]  
Devlin J., 2018, ARXIV