Language model enhanced surface chloride concentration determination for concrete within splash environment based on limited field records

被引:4
作者
Ma, Xin-Rui [1 ]
Liang, Xiao [1 ]
Wang, Shuai [2 ]
Chen, Shi-Zhi [1 ]
机构
[1] Sch Highway, Xian 710064, Peoples R China
[2] Shaanxi Prov Transport Planning Design & Res Inst, Xian 710065, Peoples R China
关键词
Concrete surface chloride concentration; Splash environment; Semantic information; Language model; Limited field records; DIFFUSION-COEFFICIENT; CORROSION; PREDICTION; REGRESSION; STEEL;
D O I
10.1016/j.cscm.2024.e03157
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Chloride ion is severely harmful to reinforced concrete (RC) structures in marine environments. For maintaining the durability and safety of the designed RC structures, the determination of chloride ion concentration on concrete surfaces is critical. Currently, surface chloride ion concentration can be determined using empirical formulas and machine learning (ML) approaches. However, these approaches only rely on the numerical information within field records, disregarding valuable semantic and background information in the records, leading to low accuracy. Meanwhile, in splash environments, it presents a significant challenge to obtain chloride ion concentration records due to the complex environment and high costs involved. Therefore, based on limited field records of surface ion concentrations in splash environments, and utilizing a stateof-the-art language model (LM), this study proposes an LM-based information generation (LMIG) model to improve the accuracy of determination of surface chloride concentrations on RC structures. This paper utilizes the numerical and semantic information in 70 sets of field records to fine-tune the LMIG model and generates 200 sets of high-quality records. These records are then used to train ML algorithms for predicting chloride ion concentrations on concrete surfaces. After conducting comparative research, it was found that incorporating records generated by the LMIG model significantly enhances the accuracy of the ML algorithm. Specifically, the predictive accuracy using the random forest algorithm increased by 33.1%. Furthermore, this paper also conducts a comparative study on the LMIG model and the generative adversarial network (GAN)assisted data-driven method. The results demonstrate that integrating semantic and numerical information into the LMIG model shows significant advantages in enhancing ML algorithms' accuracy.
引用
收藏
页数:22
相关论文
共 67 条
[1]  
Akiba T, 2019, Arxiv, DOI arXiv:1907.10902
[2]   Machine learning prediction of structural response of steel fiber-reinforced concrete beams subjected to far-field blast loading [J].
Almustafa, Monjee K. ;
Nehdi, Moncef L. .
CEMENT & CONCRETE COMPOSITES, 2022, 126
[3]   Representation Learning: A Review and New Perspectives [J].
Bengio, Yoshua ;
Courville, Aaron ;
Vincent, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828
[4]  
Borisov V., 2023, Language models are realistic tabular data generators
[5]  
Borisov V, 2023, Arxiv, DOI [arXiv:2210.06280, 10.48550/arXiv.2210.06280]
[6]   Prediction of surface chloride concentration of marine concrete using ensemble machine learning [J].
Cai, Rong ;
Han, Taihao ;
Liao, Wenyu ;
Huang, Jie ;
Li, Dawang ;
Kumar, Aditya ;
Ma, Hongyan .
CEMENT AND CONCRETE RESEARCH, 2020, 136
[7]  
[蔡荣 Cai Rong], 2019, [土木与环境工程学报(中英文), Journal of Civil and Environmental Engineering], V41, P122
[8]  
Cawley GC, 2010, J MACH LEARN RES, V11, P2079
[9]   Effects of W/B ratios and fly ash finenesses on chloride diffusion coefficient of concrete in marine environment [J].
Chalee, W. ;
Jaturapitakkul, C. .
MATERIALS AND STRUCTURES, 2009, 42 (04) :505-514
[10]   Multifidelity approach for data-driven prediction models of structural behaviors with limited data [J].
Chen, Shi-Zhi ;
Feng, De-Cheng .
COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2022, 37 (12) :1566-1581