CubiCasa5K: A Dataset and an Improved Multi-task Model for Floorplan Image Analysis

被引:43
作者
Kalervo, Ahti [1 ]
Ylioinas, Juha [1 ]
Haikio, Markus [2 ]
Karhu, Antti [2 ]
Kannala, Juho [1 ]
机构
[1] Aalto Univ, Dept Comp Sci, Espoo, Finland
[2] CubiCasa Inc, Oulu, Finland
来源
IMAGE ANALYSIS | 2019年 / 11482卷
关键词
Floorplan images; Dataset; Convolutional neural networks; Multi-task learning;
D O I
10.1007/978-3-030-20205-7_3
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Better understanding and modelling of building interiors and the emergence of more impressive AR/VR technology has brought up the need for automatic parsing of floorplan images. However, there is a clear lack of representative datasets to investigate the problem further. To address this shortcoming, this paper presents a novel image dataset called CubiCasa5K, a large-scale floorplan image dataset containing 5000 samples annotated into over 80 floorplan object categories. The dataset annotations are performed in a dense and versatile manner by using polygons for separating the different objects. Diverging from the classical approaches based on strong heuristics and low-level pixel operations, we present a method relying on an improved multi-task convolutional neural network. By releasing the novel dataset and our implementations, this study significantly boosts the research on automatic floorplan image analysis as it provides a richer set of tools for investigating the problem in a more comprehensive manner. Data and code at: https://github.com/CubiCasa/CubiCasa5k.
引用
收藏
页码:28 / 40
页数:13
相关论文
共 20 条
  • [1] Efficient Interactive Annotation of Segmentation Datasets with Polygon-RNN plus
    Acuna, David
    Ling, Huan
    Kar, Amlan
    Fidler, Sanja
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 859 - 868
  • [2] 2D Human Pose Estimation: New Benchmark and State of the Art Analysis
    Andriluka, Mykhaylo
    Pishchulin, Leonid
    Gehler, Peter
    Schiele, Bernt
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 3686 - 3693
  • [3] [Anonymous], 2016, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2016.90
  • [4] Human Pose Estimation via Convolutional Part Heatmap Regression
    Bulat, Adrian
    Tzimiropoulos, Georgios
    [J]. COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 : 717 - 732
  • [5] Multitask learning
    Caruana, R
    [J]. MACHINE LEARNING, 1997, 28 (01) : 41 - 75
  • [6] CVC-FP and SGT: a new database for structural floor plan analysis and its groundtruthing tool
    de las Heras, Lluis-Pere
    Terrades, Oriol Ramos
    Robles, Sergi
    Sanchez, Gemma
    [J]. INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2015, 18 (01) : 15 - 30
  • [7] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
  • [8] Dodge S, 2017, PROCEEDINGS OF THE FIFTEENTH IAPR INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS - MVA2017, P358, DOI 10.23919/MVA.2017.7986875
  • [9] Hinton G., 2015, NIPS
  • [10] Multi-Task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics
    Kendall, Alex
    Gal, Yarin
    Cipolla, Roberto
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7482 - 7491