A method for clustering unlabeled BIM objects using entropy and TF-IDF with RDF encoding

被引:15
作者
Ali, Mostafa [1 ]
Mohamed, Yasser [1 ]
机构
[1] Univ Alberta, 1-047 NREF Bldg, Edmonton, AB T6G 2W2, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Cluster; Semantic web; RDF; Entropy; BIM; Tf-idf; SEMANTIC SIMILARITY; INFORMATION;
D O I
10.1016/j.aei.2017.06.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Oil and gas projects involve different construction disciplines such as mechanical, structural, and electrical. The current practice in these projects involves creating separate building information models for the different disciplines and compiling them into one model to check for collisions or conflicts. Due to intellectual property, contractual requirements, unfinished design, or technical issues during final model compilation, the final merged model lacks essential data for contractors such as the trade of each object in the model. Nonetheless, the model is issued to contractors who utilize it in different pre-construction planning tasks. However, due to data loss, incompleteness, or inconsistency, the model usability can become limited and the contractor has to review the model manually to extract information from it. This is a lengthy and costly task that becomes more challenging in fast-tracked projects that involve periodically issuing updated Building Information Models. One type of information that contractors need for different planning and estimation purposes is the scope of work for different construction trades in different areas of the project. In many cases, models lack explicit attributes of 3D objects that make it possible to perform an automated query of these objects by trade type. This research suggests a state of the art solution to automate the extraction of this information in such cases. In this paper, we describe a method that utilizes Resource Description Framework (RDF) encoding of BIM data together with Term Frequency-Inverse Document Frequency (TF-IDF) and entropy-based algorithms to automatically group 3D objects based on their trade. The proposed methodology is tested using three actual cases of oil and gas projects with more than four million objects in total. The results show that the proposed method can achieve a 91% purity in the generated groups. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:154 / 163
页数:10
相关论文
共 47 条
  • [1] An information-theoretic perspective of tf-idf measures
    Aizawa, A
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2003, 39 (01) : 45 - 65
  • [2] Allemang D., 2011, SEMANTIC WEB WORKING, V2nd ed., P249, DOI [10.1016/B978-0-12-385965-5.10012-3, DOI 10.1016/B978-0-12-385965-5.10012-3]
  • [3] Factors for effective BIM governance
    Alreshidi, Eissa
    Mourshed, Monjur
    Rezgui, Yacine
    [J]. JOURNAL OF BUILDING ENGINEERING, 2017, 10 : 89 - 101
  • [4] [Anonymous], 143 STANF U
  • [5] [Anonymous], SEMANTIC WEB RES APP
  • [6] [Anonymous], DOTNETRDF PROJ
  • [7] [Anonymous], IFCOWL WELC BUILDING
  • [8] [Anonymous], ICWSM
  • [9] [Anonymous], 2008, Introduction to information retrieval
  • [10] [Anonymous], HDB GUIDE BUILDING I