Estimating Generic 3D Room Structures from 2D Annotations

被引:0
|
作者
Rozumnyi, Denys [2 ,4 ]
Popov, Stefan [1 ]
Maninis, Kevis-Kokitsi [1 ]
Niessner, Matthias [3 ]
Ferrari, Vittorio [1 ]
机构
[1] Google Res, Zurich, Switzerland
[2] Swiss Fed Inst Technol, Zurich, Switzerland
[3] Tech Univ Munich, Munich, Germany
[4] Google, Zurich, Switzerland
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Indoor rooms are among the most common use cases in 3D scene understanding. Current state-of-the-art methods for this task are driven by large annotated datasets. Room layouts are especially important, consisting of structural elements in 3D, such as wall, floor, and ceiling. However, they are difficult to annotate, especially on pure RGB video. We propose a novel method to produce generic 3D room layouts just from 2D segmentation masks, which are easy to annotate for humans. Based on these 2D annotations, we automatically reconstruct 3D plane equations for the structural elements and their spatial extent in the scene, and connect adjacent elements at the appropriate contact edges. We annotate and publicly release 2246 3D room layouts on the RealEstate10k dataset, containing YouTube videos. We demonstrate the high quality of these 3D layouts annotations with extensive experiments.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Estimating Generic 3D Room Structures from 2D Annotations
    Rozumnyi, Denys
    Popov, Stefan
    Maninis, Kevis-Kokitsi
    Nießner, Matthias
    Ferrari, Vittorio
    Advances in Neural Information Processing Systems, 2023, 36
  • [2] Learning to Segment 3D Linear Structures Using Only 2D Annotations
    Kozinski, Mateusz
    Mosinska, Agata
    Salzmann, Mathieu
    Fua, Pascal
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2018, PT II, 2018, 11071 : 283 - 291
  • [3] Interactive 3D Character Modeling from 2D Orthogonal Drawings with Annotations
    Huang, Zhengyu
    Xie, Haoran
    Fukusato, Tsukasa
    INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY (IWAIT) 2022, 2022, 12177
  • [4] Interpreting 2D Gesture Annotations in 3D Augmented Reality
    Nuernberger, Benjamin
    Lien, Kuo-Chin
    Hollerer, Tobias
    Turk, Matthew
    2016 IEEE SYMPOSIUM ON 3D USER INTERFACES (3DUI), 2016, : 149 - 158
  • [5] Generic modeling of 3D objects from single 2D images
    Bilodeau, GA
    Bergevin, R
    15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS: COMPUTER VISION AND IMAGE ANALYSIS, 2000, : 770 - 773
  • [6] Estimating 3D Objects from 2D Images using 3D Transformation Network
    Ul Islam, Naeem
    Park, Jaebyung
    2021 18TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS (UR), 2021, : 471 - 475
  • [7] 3D Human Pose Estimation via Deep Learning from 2D annotations
    Brau, Ernesto
    Jiang, Hao
    PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, : 582 - 591
  • [8] Robotic Folding of 2D and 3D Structures from a Ribbon
    Wang, Liyu
    Plecnik, Mark M.
    Fearing, Ronald S.
    2016 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2016, : 3655 - 3660
  • [9] ON AUTOMATIC RECOGNITION OF 3D STRUCTURES FROM 2D REPRESENTATIONS
    ALDEFELD, B
    COMPUTER-AIDED DESIGN, 1983, 15 (02) : 59 - 64
  • [10] Estimating 3D Camera Pose from 2D Pedestrian Trajectories
    Xu, Yan
    Roy, Vivek
    Kitani, Kris
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 2568 - 2577