An extreme value prediction method based on clustering algorithm

被引:22
作者
Dai, Baorui [1 ]
Xia, Ye [1 ]
Li, Qi [1 ]
机构
[1] Tongji Univ, Dept Bridge Engn, 1239 Siping Rd, Shanghai 200092, Peoples R China
基金
中国国家自然科学基金;
关键词
Random variables; Extreme value; Mixed distribution; Generalized extreme value mixture model; Clustering; Elbow method; IN-MOTION DATA; FREQUENCY-DISTRIBUTION; LOAD; MODEL; RELIABILITY; MIXTURE; SIMULATION; QUANTILES; FATIGUE; MAXIMUM;
D O I
10.1016/j.ress.2022.108442
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Extreme value prediction has been widely applied in many safety-critical scenarios. Due to the influence of mixed types of events, the random variables oftentimes do not comply with the independence and identical distributions. Neglecting the mixed distribution characteristics of these variables may lead to inaccurate extreme value prediction. To solve this problem, this study proposes a novel clustering algorithm based on the generalized extreme value mixture model (GEVMM). The algorithm adaptively classifies the block maximum data into different clusters and synthesizes the clusters according to their weights in the population, thus forming a GEVMM that can predict the maximum values in a given return period. The elbow method combined with root mean squared error (RMSE) and coefficient of determination (R-squared) is used to select the optimal number of clusters to prevent over- and under-fitting the model. Through theoretical examples, the proposed method shows strong applicability to promote the accurate extrapolation of extreme values regardless of overlap among the original mixture components. To demonstrate the practical application of the proposed approach, traffic load effects on bridges based on weight-in-motion data are used to extrapolate extreme values during a specific return period. The process and results show that the developed approach is more reliable for estimating extreme values with mixed probability distribution as compared with existing methods. It also provides a powerful tool for extreme value analysis of mixed distribution data in other fields.
引用
收藏
页数:12
相关论文
共 51 条
[1]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[2]  
[Anonymous], 2003, EUROCODE1 ACT STRU 2
[3]  
Barakat H. M., 2019, Statistical techniques for modelling extreme value data and related applications
[4]  
Bertsekas D.P., 2014, Constrained optimization and Lagrange multiplier methods, DOI DOI 10.1016/C2013-0-10366-2
[5]   Wind Speed Prediction Using a Univariate ARIMA Model and a Multivariate NARX Model [J].
Cadenas, Erasmo ;
Rivera, Wilfrido ;
Campos-Amezcua, Rafael ;
Heard, Christopher .
ENERGIES, 2016, 9 (02)
[6]   Characteristic traffic load effects from a mixture of loading events on short to medium span bridges [J].
Caprani, Colin C. ;
OBrien, Eugene J. ;
McLachlan, Geoff J. .
STRUCTURAL SAFETY, 2008, 30 (05) :394-404
[7]   Long-span bridge traffic loading based on multi-lane traffic micro-simulation [J].
Caprani, Colin C. ;
OBrien, Eugene J. ;
Lipari, Alessandro .
ENGINEERING STRUCTURES, 2016, 115 :207-219
[8]   Conditional Gaussian mixture model for warranty claims forecasting [J].
Chehade, Abdallah ;
Savargaonkar, Mayuresh ;
Krivtsov, Vasiliy .
RELIABILITY ENGINEERING & SYSTEM SAFETY, 2022, 218
[9]   Modeling stochastic live load for long-span bridge based on microscopic traffic flow simulation [J].
Chen, S. R. ;
Wu, J. .
COMPUTERS & STRUCTURES, 2011, 89 (9-10) :813-824
[10]   Impact of Road Traffic Tendency in Europe on Fatigue Assessment of Bridges [J].
Croce, Pietro .
APPLIED SCIENCES-BASEL, 2020, 10 (04)