Role of Data Analytics in Infrastructure Asset Management: Overcoming Data Size and Quality Problems

被引:123
作者
Piryonesi, S. Madeh [1 ]
El-Diraby, Tamer E. [1 ]
机构
[1] Univ Toronto, Dept Civil & Mineral Engn, 35 St George St, Toronto, ON M5S 1A4, Canada
关键词
Machine learning; Ensemble learning; Transportation asset management; Pavement condition index; Highway maintenance; Data preparation; PAVEMENT PERFORMANCE; CRACK INITIATION; ROUGHNESS; MODEL; PREDICTION; REGRESSION; IRI; ANN;
D O I
10.1061/JPEODX.0000175
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
This study explores the performance regime of different classification algorithms as they are applied to the analysis of asphalt pavement deterioration data. The aim is to examine how different algorithms deal with the typically limited and low-quality data sets in the infrastructure asset management domain, and whether better configurations of relevant algorithms help overcome these limitations. Furthermore, the emphasis on choosing the most affordable attributes (e.g., temperature and precipitation levels) makes the results reproducible to smaller municipalities. This analysis used the data of more than 3,000 examples of road sections, which were retrieved from the Long-Term Pavement Performance (LTPP) database. The algorithms examined in this study include two types of decision trees, naive Bayes classifier, naive Bayes coupled with kernels, logistic regression, k-nearest neighbors (k-NN), random forest, and gradient boosted trees. The performance of these algorithms is compared, and their weaknesses and strengths are discussed. They were all applied to predict the deterioration of pavement condition index (PCI). Of specific importance is the positive role of ensemble learning. It is shown how using higher efficiencies by using ensemble learning can compensate for data shortcomings. The accuracy of some of the models in predicting the PCI after 3 years exceeded 90%. Suggestions are made to improve the performance of some algorithms. For instance, the naive Bayes classifier was coupled with kernel estimates to achieve a better accuracy. It is demonstrated that using kernel estimates can increase the accuracy of the naive Bayes classifier dramatically. Further, the study examines the impact of data segmentation. Data were divided into four different climatic regions. The accuracy of prediction was sufficiently high after segmentation, with the highest accuracy in the dry and nonfreeze zone and the lowest performance in the region with a wet and freezing climate.
引用
收藏
页数:15
相关论文
共 79 条
[1]   Barriers to Implementing Data-Driven Pavement Treatment Performance Evaluation Process [J].
Abdelaty, Ahmed ;
Jeong, H. David ;
Smadi, Omar .
JOURNAL OF TRANSPORTATION ENGINEERING PART B-PAVEMENTS, 2018, 144 (01)
[2]  
Al-Suleiman T.I., 2003, Int. J. Pavement. Eng, V4, P121, DOI [DOI 10.1080/10298430310001634834, 10.1080/10298430310001634834]
[3]   Artificial neural network approach for pavement maintenance [J].
Alsugair, AM ;
Al-Qudrah, AA .
JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 1998, 12 (04) :249-255
[4]  
[Anonymous], 2008, Mechanical empirical pavement design guide: a manual of practice
[5]   Hierarchical asphalt pavement deterioration model for climate impact studies [J].
Anyala, M. ;
Odoki, J. B. ;
Baker, C. J. .
INTERNATIONAL JOURNAL OF PAVEMENT ENGINEERING, 2014, 15 (03) :251-266
[6]   Development of a pavement rutting model from experimental data [J].
Archilla, AR ;
Madanat, S .
JOURNAL OF TRANSPORTATION ENGINEERING, 2000, 126 (04) :291-299
[7]  
ASTM, 2015, D643307 ASTM
[8]  
Ayed A., 2016, Development of Empirical and Mechanistic Empirical Performance Models at Project and Network Levels
[9]   A semi-Markov approach for modelling asset deterioration [J].
Black, M ;
Brint, AT ;
Brailsford, JR .
JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2005, 56 (11) :1241-1249
[10]  
Caro S, 2008, Int J Pavement Eng, V9, P81