Cost-Efficient Image Semantic Segmentation for Indoor Scene Understanding Using Weakly Supervised Learning and BIM

被引:8
作者
Yang, Liu [1 ]
Cai, Hubo [2 ]
机构
[1] Purdue Univ, Lyles Sch Civil Engn, Div Construction Engn & Management, 550 Stadium Mall Dr, W Lafayette, IN 47907 USA
[2] Purdue Univ, Lyles Sch Civil Engn, Div Construction Engn & Management, 550 Stadium Mall Dr, W Lafayette, IN 47907 USA
关键词
Weakly supervised learning; Image-level labels; Semantic segmentation; Building information modeling (BIM); Constrained loss; Deep learning; DEEP; RECOGNITION; NETWORK; MODEL;
D O I
10.1061/JCCEE5.CPENG-5065
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Image segmentation is an essential step in vision sensing and image processing. It enables the understanding of the object's classes, spatial locations, and extents in the scene, which can be used to support a wide range of construction applications such as progress monitoring, safety management, and productivity analysis. The recent ground-breaking achievements of deep learning-based approaches for semantic segmentation are at the cost of expensive large-scale training datasets annotated at the pixel level. Although building information modeling (BIM) has been leveraged to alleviate labeling costs using automatically generated, color-coded images as semantic labels, the differences between the BIM models and the real-world scenes make it difficult to apply networks trained on BIM-generated labels to real images. Furthermore, it takes nontrivial efforts to reduce such differences. To address these problems, this paper proposes a weakly supervised segmentation approach that uses inexpensive image-level labels. The missing boundary information in image-level labels is compensated by BIM-extracted object information. The proposed method consists of three modules: (1) detect initial object locations from image-level labels; (2) extract object information from BIM as prior knowledge; and (3) incorporate the prior knowledge into the network to enhance the detected object locations. Three extensive experiments are designed to evaluate the effectiveness of the proposed method. Results show that the proposed method substantially improves the detected object areas by using prior knowledge of target objects from BIM and outperforms the state-of-the-art weakly supervised methods.
引用
收藏
页数:15
相关论文
共 101 条
  • [1] Acharya D., 2019, Tech. Rep
  • [2] Single-image localisation using 3D models: Combining hierarchical edge maps and semantic segmentation for domain adaptation
    Acharya, Debaditya
    Tennakoon, Ruwan
    Muthu, Sundaram
    Khoshelham, Kourosh
    Hoseinnezhad, Reza
    Bab-Hadiashar, Alireza
    [J]. AUTOMATION IN CONSTRUCTION, 2022, 136
  • [3] BIM-PoseNet: Indoor camera localisation using a 3D indoor model and deep learning from synthetic images
    Acharya, Debaditya
    Khoshelham, Kourosh
    Winter, Stephan
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2019, 150 : 245 - 258
  • [4] Learning Pixel-level Semantic Affinity with Image-level Supervision forWeakly Supervised Semantic Segmentation
    Ahn, Jiwoon
    Kwak, Suha
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4981 - 4990
  • [5] Al Qurishee Murad, 2020, Engineering, V12, P151, DOI DOI 10.4236/ENG.2020.123013
  • [6] Alawadlhi M., 2020, PROC 40 ANN C ASS CO, P228
  • [7] Alvares J.S., 2019, P 27 ANN C INT GRUP, P1445, DOI DOI 10.24928/2019/0165
  • [8] Dataset and benchmark for detecting moving objects in construction sites
    An Xuehui
    Zhou Li
    Liu Zuguang
    Wang Chengzhi
    Li Pengfei
    Li Zhiwei
    [J]. AUTOMATION IN CONSTRUCTION, 2021, 122 (122)
  • [9] [Anonymous], 2011, P ADV NEUR INF PROC
  • [10] What's the Point: Semantic Segmentation with Point Supervision
    Bearman, Amy
    Russakovsky, Olga
    Ferrari, Vittorio
    Fei-Fei, Li
    [J]. COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 : 549 - 565