Model-based clustering (MBC) for road data via multivariate mixture of normal distributions and factor analysis (FA)

被引:2
作者
Yaghoubi, Solmaz [1 ]
Farnoosh, Rahman [2 ]
Behzadi, Mohammad Hassan [1 ]
机构
[1] Islamic Azad Univ, Sci & Res Branch, Tehran, Iran
[2] Iran Univ Sci & Technol, Sch Math, Tehran, Iran
关键词
Mixture model; Clustering road data; Multivariate normal; Factor analysis; GAS model; INJURY SEVERITY; REGRESSION; NUMBER;
D O I
10.1080/09720510.2021.2000169
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In road traffic, there are many influential parameters that need to be analysed. For a long time, the statistical methods have been used for analysing, planning and preventing accidents in advanced countries. In this study, multivariate normal distribution was applied to a finite mixture model for clustering roads. In order to solve the problems of road traffic safety and road accidents, a new FA (Factor analysis)-MBC (Model based clustering) method was introduced based on the math theory of FA-MBC. First, the FA method was used to reduce the dimensions of the features, and then our proposed method (MBC) was applied to cluster the road data gathered for Tehran Province, Iran. The results show that our model has a good performance in dealing with data that may have high-dimensional features.
引用
收藏
页数:11
相关论文
共 15 条
[1]   The LINEX Weighted k-Means Clustering [J].
Ahmadzadehgoli, Narges ;
Mohammadpour, Adel ;
Behzadi, Mohammad Hassan .
JOURNAL OF STATISTICAL THEORY AND APPLICATIONS, 2019, 18 (02) :147-154
[2]   Using logistic regression to estimate the influence of accident factors on accident severity [J].
Al-Ghamdi, AS .
ACCIDENT ANALYSIS AND PREVENTION, 2002, 34 (06) :729-741
[3]   Determining the number of factors in approximate factor models [J].
Bai, JS ;
Ng, S .
ECONOMETRICA, 2002, 70 (01) :191-221
[4]   Analysis of traffic injury severity: An application of non-parametric classification tree techniques [J].
Chang, Li-Yen ;
Wang, Hsiu-Wen .
ACCIDENT ANALYSIS AND PREVENTION, 2006, 38 (05) :1019-1027
[5]   CLUSTER SEPARATION MEASURE [J].
DAVIES, DL ;
BOULDIN, DW .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1979, 1 (02) :224-227
[6]   Recovering the number of clusters in data sets with noise features using feature rescaling factors [J].
de Amorim, Renato Cordeiro ;
Hennig, Christian .
INFORMATION SCIENCES, 2015, 324 :126-145
[7]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[8]  
Harvey A., 2013, Dynamic Models for Volatility and Heavy Tails: With Applications to Financial and Economic Time Series
[9]  
HURVICH CM, 1989, BIOMETRIKA, V76, P297, DOI 10.2307/2336663
[10]   Bank Business Models at Zero Interest Rates [J].
Lucas, Andre ;
Schaumburg, Julia ;
Schwaab, Bernd .
JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 2019, 37 (03) :542-555