Machine Learning Algorithms for Construction Projects Delay Risk Prediction

被引:186
作者
Gondia, Ahmed [1 ]
Siam, Ahmad [1 ]
El-Dakhakhni, Wael [2 ]
Nassar, Ayman H. [3 ]
机构
[1] McMaster Univ, Dept Civil Engn, 1280 Main St West, Hamilton, ON L8S 4L7, Canada
[2] McMaster Univ, INViSiONLab, Dept Civil Engn, 1280 Main St West, Hamilton, ON L8S 4L7, Canada
[3] German Univ Cairo, Construct & Project Management, Dept Civil Engn, Cairo 11835, Egypt
基金
加拿大自然科学与工程研究理事会;
关键词
Classification; Complex systems; Confusion matrices; Construction projects; Cross validation; Delay risk analysis; Machine learning; Predictive data analytics; Risk identification; Time delay; MONTE-CARLO-SIMULATION; CLASSIFICATION; MANAGEMENT; MODEL; ATTRIBUTES;
D O I
10.1061/(ASCE)CO.1943-7862.0001736
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Projects delays are among the most pressing challenges faced by the construction sector attributed to the sector's complexity and its inherent delay risk sources' interdependence. Machine learning offers an ideal set of techniques capable of tackling such complex systems; however, adopting such techniques within the construction sector remains at an early stage. The goal of this study was to identify and develop machine learning models in order to facilitate accurate project delay risk analysis and prediction using objective data sources. As such, relevant delay risk sources and factors were first identified, and a multivariate data set of previous projects' time performance and delay-inducing risk sources was then compiled. Subsequently, the complexity and interdependence of the system was uncovered through an exploratory data analysis. Accordingly, two suitable machine learning models, utilizing decision tree and naive Bayesian classification algorithms, were identified and trained using the data set for predicting project delay extents. Finally, the predictive performances of both models were evaluated through cross validation tests, and the models were further compared using machine-learning-relevant performance indices. The evaluation results indicated that the naive Bayesian model provides a better predictive performance for the data set examined. Ultimately, the work presented herein harnesses the power of machine learning to facilitate evidence-based decision making, while inherent risk factors are active, interdependent, and dynamic, thus empowering proactive project risk management strategies.
引用
收藏
页数:16
相关论文
共 64 条
[1]  
Aburrous Maher, 2010, Proceedings of the Seventh International Conference on Information Technology: New Generations (ITNG 2010), P176, DOI 10.1109/ITNG.2010.117
[2]  
Aggarwal C. C., 2016, Data mining: the textbook, P285
[3]  
Aibinu A.A., 2002, INT J PROJ MANAG, V20, P593, DOI [10.1016/S0263-7863(02)00028-5, DOI 10.1016/S0263-7863(02)00028-5]
[4]  
Al-Momani A.H., 2000, INT J PROJ MANAG, V18, P51, DOI [10.1016/S0263-7863(98)00060-X, DOI 10.1016/S0263-7863(98)00060-X]
[5]   The impact of contractors' attributes on construction project success: A post construction evaluation [J].
Alzahrani, Jaman I. ;
Emsley, Margaret W. .
INTERNATIONAL JOURNAL OF PROJECT MANAGEMENT, 2013, 31 (02) :313-322
[6]  
Amor N.B., 2004, P ACM S APPL COMP, P420
[7]  
[Anonymous], 2018, R PACKAGE VERSION 41
[8]  
[Anonymous], 2017, A Guide to the SCRUM BODY OF KNOWLEDGE (SBOK TM GUIDE) Third Edition A Comprehensive Guide to Deliver Projects using Scrum Includes two chapters about Scaling Scrum for Large Projects and the Enterprise
[9]  
[Anonymous], 2009, ELEMENTS STAT LEARNI, DOI 10.1007/978-0-387-84858-7
[10]  
[Anonymous], 2017, PROJECT MANAGEMENT S