Deep Learning Based Semantic Segmentation for BIM Model Generation from RGB-D Sensors

被引：1

作者：

Rached, Ishraq ^{[1
]}

Hajji, Rafika ^{[1
]}

Landes, Tania ^{[2
]}

Haffadi, Rashid ^{[3
]}

机构：

[1] Inst Agron & Vet Med, Coll Geomat Sci & Surveying Engn, Rabat 6202, Morocco

[2] Natl Inst Appl Sci INSA Strasbourg, Photogrammetry & Geomat Grp, ICube Lab UMR 7357, 24 Blvd Victoire, F-67084 Strasbourg, France

[3] GEOPTIMA, B4,Med El Amraoui St,Corner Sebou St,Off 4, Kenitra, Morocco

来源：

19TH 3D GEOINFO CONFERENCE 2024, VOL. 10-4 | 2024年

关键词：

RGB-D Camera; Semantic Segmentation; Deep Learning; As-built BIM; TOOL;

D O I：

10.5194/isprs-annals-X-4-W5-2024-271-2024

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

RGB-D sensors offer a low-cost and promising solution to streamline the generation of BIM models. This paper introduces a framework designed to automate the creation of detailed and semantically rich BIM models from RGB-D data in indoor environments. The framework leverages advanced computer vision and deep learning techniques to overcome the challenges associated with traditional, labour-intensive BIM modeling methods. The results show that the proposed method is robust and accurate, compared to the high-quality statistic laser scanning TLS. Indeed, 58% of the distances measured between the calculated and the reference point cloud produced by TLS were under 5 cm, and 82% of distances were smaller than 7 cm. Furthermore, the framework achieves 100% accuracy in element extraction. Beyond its accuracy, the proposed framework significantly enhances efficiency in both data acquisition and processing. In contrast to the time-consuming process associated with TLS, our approach remarkably reduces the data collection and processing time by factor of height.This highlights the framework's substantial improvements in accuracy and efficiency throughout the BIM generation workflows, making it a streamlined and time-effective solution.

引用

页码：271 / 279

页数：9

共 38 条

[1] Review on Indoor RGB-D Semantic Segmentation with Deep Convolutional Neural Networks [J].

Barchid, Sami ;

Mennesson, Jose ;

Djeraba, Chaabane .

2021 INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2021, :199-202

[2] Speeded-Up Robust Features (SURF) [J].

Bay, Herbert ;

Ess, Andreas ;

Tuytelaars, Tinne ;

Van Gool, Luc .

COMPUTER VISION AND IMAGE UNDERSTANDING, 2008, 110 (03) :346-359

[3] Calibrate Multiple Consumer RGB-D Cameras for Low-Cost and Efficient 3D Indoor Mapping [J].

Chen, Chi ;

Yang, Bisheng ;

Song, Shuang ;

Tian, Mao ;

Li, Jianping ;

Dai, Wenxia ;

Fang, Lina .

REMOTE SENSING, 2018, 10 (02)

[4] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].

Chen, Liang-Chieh ;

Papandreou, George ;

Kokkinos, Iasonas ;

Murphy, Kevin ;

Yuille, Alan L. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848

[5] OBJECT MODELING BY REGISTRATION OF MULTIPLE RANGE IMAGES [J].

CHEN, Y ;

MEDIONI, G .

IMAGE AND VISION COMPUTING, 1992, 10 (03) :145-155

[6] State-of-the-Art Review on Mixed Reality Applications in the AECO Industry [J].

Cheng, Jack C. P. ;

Chen, Keyu ;

Chen, Weiwei .

JOURNAL OF CONSTRUCTION ENGINEERING AND MANAGEMENT, 2020, 146 (02)

[7] Xception: Deep Learning with Depthwise Separable Convolutions [J].

Chollet, Francois .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1800-1807

[8]

Dey E. K., 2021, 2021 Digital Image Computing: Techniques and Applications (DICTA), P1

[9] Omnidata: A Scalable Pipeline for Making Multi-Task Mid-Level Vision Datasets from 3D Scans [J].

Eftekhar, Ainaz ;

Sax, Alexander ;

Malik, Jitendra ;

Zamir, Amir .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :10766-10776

[10] RANDOM SAMPLE CONSENSUS - A PARADIGM FOR MODEL-FITTING WITH APPLICATIONS TO IMAGE-ANALYSIS AND AUTOMATED CARTOGRAPHY [J].

FISCHLER, MA ;

BOLLES, RC .

COMMUNICATIONS OF THE ACM, 1981, 24 (06) :381-395

← 1 2 3 4 →