Estimating Generic 3D Room Structures from 2D Annotations

被引：0

作者：

Rozumnyi, Denys ^{[2
,4
]}

Popov, Stefan ^{[1
]}

Maninis, Kevis-Kokitsi ^{[1
]}

Niessner, Matthias ^{[3
]}

Ferrari, Vittorio ^{[1
]}

机构：

[1] Google Res, Zurich, Switzerland

[2] Swiss Fed Inst Technol, Zurich, Switzerland

[3] Tech Univ Munich, Munich, Germany

[4] Google, Zurich, Switzerland

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Indoor rooms are among the most common use cases in 3D scene understanding. Current state-of-the-art methods for this task are driven by large annotated datasets. Room layouts are especially important, consisting of structural elements in 3D, such as wall, floor, and ceiling. However, they are difficult to annotate, especially on pure RGB video. We propose a novel method to produce generic 3D room layouts just from 2D segmentation masks, which are easy to annotate for humans. Based on these 2D annotations, we automatically reconstruct 3D plane equations for the structural elements and their spatial extent in the scene, and connect adjacent elements at the appropriate contact edges. We annotate and publicly release 2246 3D room layouts on the RealEstate10k dataset, containing YouTube videos. We demonstrate the high quality of these 3D layouts annotations with extensive experiments.

引用

页数：13

共 50 条

[1] Estimating Generic 3D Room Structures from 2D Annotations
Rozumnyi, Denys
Popov, Stefan
Maninis, Kevis-Kokitsi
Nießner, Matthias
Ferrari, Vittorio
Advances in Neural Information Processing Systems, 2023, 36
[2] Learning to Segment 3D Linear Structures Using Only 2D Annotations
Kozinski, Mateusz
Mosinska, Agata
Salzmann, Mathieu
Fua, Pascal
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2018, PT II, 2018, 11071 : 283 - 291
[3] Interactive 3D Character Modeling from 2D Orthogonal Drawings with Annotations
Huang, Zhengyu
Xie, Haoran
Fukusato, Tsukasa
INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY (IWAIT) 2022, 2022, 12177
[4] Interpreting 2D Gesture Annotations in 3D Augmented Reality
Nuernberger, Benjamin
Lien, Kuo-Chin
Hollerer, Tobias
Turk, Matthew
2016 IEEE SYMPOSIUM ON 3D USER INTERFACES (3DUI), 2016, : 149 - 158
[5] Generic modeling of 3D objects from single 2D images
Bilodeau, GA
Bergevin, R
15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS: COMPUTER VISION AND IMAGE ANALYSIS, 2000, : 770 - 773
[6] Estimating 3D Objects from 2D Images using 3D Transformation Network
Ul Islam, Naeem
Park, Jaebyung
2021 18TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS (UR), 2021, : 471 - 475
[7] 3D Human Pose Estimation via Deep Learning from 2D annotations
Brau, Ernesto
Jiang, Hao
PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, : 582 - 591
[8] Robotic Folding of 2D and 3D Structures from a Ribbon
Wang, Liyu
Plecnik, Mark M.
Fearing, Ronald S.
2016 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2016, : 3655 - 3660
[9] ON AUTOMATIC RECOGNITION OF 3D STRUCTURES FROM 2D REPRESENTATIONS
ALDEFELD, B
COMPUTER-AIDED DESIGN, 1983, 15 (02) : 59 - 64
[10] Estimating 3D Camera Pose from 2D Pedestrian Trajectories
Xu, Yan
Roy, Vivek
Kitani, Kris
2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 2568 - 2577

← 1 2 3 4 5 →