Cost-Efficient Image Semantic Segmentation for Indoor Scene Understanding Using Weakly Supervised Learning and BIM

被引：8

作者：

Yang, Liu ^{[1
]}

Cai, Hubo ^{[2
]}

机构：

[1] Purdue Univ, Lyles Sch Civil Engn, Div Construction Engn & Management, 550 Stadium Mall Dr, W Lafayette, IN 47907 USA

[2] Purdue Univ, Lyles Sch Civil Engn, Div Construction Engn & Management, 550 Stadium Mall Dr, W Lafayette, IN 47907 USA

来源：

JOURNAL OF COMPUTING IN CIVIL ENGINEERING | 2023年 / 37卷 / 02期

关键词：

Weakly supervised learning; Image-level labels; Semantic segmentation; Building information modeling (BIM); Constrained loss; Deep learning; DEEP; RECOGNITION; NETWORK; MODEL;

D O I：

10.1061/JCCEE5.CPENG-5065

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Image segmentation is an essential step in vision sensing and image processing. It enables the understanding of the object's classes, spatial locations, and extents in the scene, which can be used to support a wide range of construction applications such as progress monitoring, safety management, and productivity analysis. The recent ground-breaking achievements of deep learning-based approaches for semantic segmentation are at the cost of expensive large-scale training datasets annotated at the pixel level. Although building information modeling (BIM) has been leveraged to alleviate labeling costs using automatically generated, color-coded images as semantic labels, the differences between the BIM models and the real-world scenes make it difficult to apply networks trained on BIM-generated labels to real images. Furthermore, it takes nontrivial efforts to reduce such differences. To address these problems, this paper proposes a weakly supervised segmentation approach that uses inexpensive image-level labels. The missing boundary information in image-level labels is compensated by BIM-extracted object information. The proposed method consists of three modules: (1) detect initial object locations from image-level labels; (2) extract object information from BIM as prior knowledge; and (3) incorporate the prior knowledge into the network to enhance the detected object locations. Three extensive experiments are designed to evaluate the effectiveness of the proposed method. Results show that the proposed method substantially improves the detected object areas by using prior knowledge of target objects from BIM and outperforms the state-of-the-art weakly supervised methods.

引用

页数：15

共 101 条

[1] Acharya D., 2019, Tech. Rep
[2] Single-image localisation using 3D models: Combining hierarchical edge maps and semantic segmentation for domain adaptation
Acharya, Debaditya
Tennakoon, Ruwan
Muthu, Sundaram
Khoshelham, Kourosh
Hoseinnezhad, Reza
Bab-Hadiashar, Alireza
[J]. AUTOMATION IN CONSTRUCTION, 2022, 136
[3] BIM-PoseNet: Indoor camera localisation using a 3D indoor model and deep learning from synthetic images
Acharya, Debaditya
Khoshelham, Kourosh
Winter, Stephan
[J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2019, 150 : 245 - 258
[4] Learning Pixel-level Semantic Affinity with Image-level Supervision forWeakly Supervised Semantic Segmentation
Ahn, Jiwoon
Kwak, Suha
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4981 - 4990
[5] Al Qurishee Murad, 2020, Engineering, V12, P151, DOI DOI 10.4236/ENG.2020.123013
[6] Alawadlhi M., 2020, PROC 40 ANN C ASS CO, P228
[7] Alvares J.S., 2019, P 27 ANN C INT GRUP, P1445, DOI DOI 10.24928/2019/0165
[8] Dataset and benchmark for detecting moving objects in construction sites
An Xuehui
Zhou Li
Liu Zuguang
Wang Chengzhi
Li Pengfei
Li Zhiwei
[J]. AUTOMATION IN CONSTRUCTION, 2021, 122 (122)
[9] [Anonymous], 2011, P ADV NEUR INF PROC
[10] What's the Point: Semantic Segmentation with Point Supervision
Bearman, Amy
Russakovsky, Olga
Ferrari, Vittorio
Fei-Fei, Li
[J]. COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 : 549 - 565

← 1 2 3 4 5 6 7 8 9 10 →