Automated Requirements Identification from Construction Contract Documents Using Natural Language Processing

被引:82
作者
Hassan, Fahad Ul [1 ]
Le, Tuyen [1 ]
机构
[1] Clemson Univ, Glenn Dept Civil Engn, Clemson, SC 29634 USA
关键词
Construction contracts; Scope comprehension; Project requirements; Natural language processing; Machine learning; Text classification; Project management; CLASSIFICATION; INDUSTRY;
D O I
10.1061/(ASCE)LA.1943-4170.0000379
中图分类号
D9 [法律]; DF [法律];
学科分类号
0301 ;
摘要
Contract documents are a critical legal component of a construction project that specify all wishes and expectations of the owner toward the design, construction, and handover of a project. Precise comprehension of the contract documents is critical to ensure that all important contractual requirements of the project scope are captured and managed. A contract package typically includes both requirements and other unimportant texts such as instructions and supporting statements; thus, practitioners are required to read and identify texts indicating the requirements. The conventional manual practice of scope comprehension requires much time and effort and may include human errors. Little attention has been paid toward automated identification of requirement texts. This study introduces an effective way to identify contractual requirements by developing an automated framework using natural language processing (NLP) and machine learning techniques. Four different machine learning algorithms, namely Naive Bayes, support vector machines, logistic regression, and feedforward neural network were used to develop the classification models. The models classified the contractual text into requirement and nonrequirement text. Experiments showed that the support vector machine model outperforms the other models in terms of accuracy, precision, recall, and F1-score. In addition, unigrams yield better results than higher n-gram features. An experimental study including human participants further proves that the developed model is efficient and effective that can help reduce reading time and improve contract scope comprehension.
引用
收藏
页数:12
相关论文
共 54 条
[1]  
Abu Sheikha Fadi, 2010, 2010 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE 2010), DOI 10.1109/NLPKE.2010.5587767
[2]  
Balakrishnan Vimala, 2014, Lecture Notes on Software Engineering, V2, P262, DOI 10.7763/LNSE.2014.V2.134
[3]  
Berenbach Brian, 2010, 2010 Third International Workshop on Requirements Engineering and Law (RELAW 2010), P27, DOI 10.1109/RELAW.2010.5625354
[4]   Big Data in the construction industry: A review of present status, opportunities, and future trends [J].
Bilal, Muhammad ;
Oyedele, Lukumon O. ;
Qadir, Junaid ;
Munir, Kamran ;
Ajayi, Saheed O. ;
Akinade, Olugbenga O. ;
Owolabi, Hakeem A. ;
Alaka, Hafiz A. ;
Pasha, Maruf .
ADVANCED ENGINEERING INFORMATICS, 2016, 30 (03) :500-521
[5]  
Boulis C., 2005, P INT WORKSH FEAT SE, P9
[6]  
Breaux TD, 2006, RE'06: 14TH IEEE INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE, PROCEEDINGS, P49
[7]  
BUCKLAND M, 1994, J AM SOC INFORM SCI, V45, P12, DOI 10.1002/(SICI)1097-4571(199401)45:1<12::AID-ASI2>3.0.CO
[8]  
2-L
[9]  
Bunni N.G., 2003, Risk and Insurance in Construction, VSecond
[10]  
Chalkidis I, 2018, PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, P254