3D Priors for Scene Learning from a Single View

被引:0
作者
Rother, Diego [1 ]
Patwardban, Kedar [2 ]
Aganj, Iman [1 ]
Sapiro, Guillermo [1 ]
机构
[1] Univ Minnesota, Minneapolis, MN 55455 USA
[2] GE Co, Hyderabad, Andhra Pradesh, India
来源
2008 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, VOLS 1-3 | 2008年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A framework for scene learning from a single still video camera is presented in this work. In particular, the camera transformation and the direction of the shadows are learned using information extracted from pedestrians walking in the scene. The proposed approach poses the scene learning estimation as a likelihood maximization problem, efficiently solved via factorization and dynamic programming, and amenable to an online implementation. We introduce a 3D prior to model the pedestrian's appearance from any viewpoint, and learn it using a standard off-the-shelf consumer video camera and the Radon transform. This 3D prior or "appearance model" is used to quantify the agreement between the tentative parameters and the actual video observations, taking into account not only the pixels occupied by the pedestrian, but also those occupied by the his shadows and/or reflections. The presentation of the framework is complemented with an example of a casual video scene showing the importance of the learned 3D pedestrian prior and the accuracy of the proposed approach.
引用
收藏
页码:635 / +
页数:2
相关论文
共 50 条
  • [41] Geometry-Free View Synthesis: Transformers and no 3D Priors
    Rombach, Robin
    Esser, Patrick
    Ommer, Bjoern
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14336 - 14346
  • [42] 3D-Scene-Former: 3D scene generation from a single RGB image using Transformers
    Chatterjee, Jit
    Vega, Maria Torres
    [J]. VISUAL COMPUTER, 2025, 41 (04) : 2875 - 2889
  • [43] Inferring 3D scene structure from a single polarization image
    Rahmann, S
    [J]. POLARIZATION AND COLOR TECHNIQUES IN INDUSTRIAL INSPECTION, 1999, 3826 : 22 - 33
  • [44] Panoptic 3D Scene Reconstruction From a Single RGB Image
    Dahnert, Manuel
    Hou, Ji
    Niessner, Matthias
    Dai, Angela
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [45] Automatic 3D Indoor Scene Modeling from Single Panorama
    Yang, Yang
    Jin, Shi
    Liu, Ruiyang
    Kang, Sing Bing
    Yu, Jingyi
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3926 - 3934
  • [46] Unsupervised Learning of 3D Scene Flow from Monocular Camera
    Wang, Guangming
    Tian, Xiaoyu
    Ding, Ruiqi
    Wang, Hesheng
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 4325 - 4331
  • [47] Data Augmented 3D Semantic Scene Completion with 2D Segmentation Priors
    Dourado, Aloisio
    Guth, Frederico
    de Campos, Teofilo
    [J]. 2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 687 - 696
  • [48] Single-view 3D Scene Reconstruction with High-fidelity Shape and Texture
    Chen, Yixin
    Ni, Junfeng
    Jiang, Nan
    Zhang, Yaowei
    Zhu, Yixin
    Huang, Siyuan
    [J]. 2024 INTERNATIONAL CONFERENCE IN 3D VISION, 3DV 2024, 2024, : 1456 - 1467
  • [49] LIST: Learning Implicitly from Spatial Transformers for Single-View 3D Reconstruction
    Arshad, Mohammad Samiul
    Beksi, William J.
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 9287 - 9296
  • [50] 3D building reconstruction from single street view images using deep learning
    Pang, Hui En
    Biljecki, Filip
    [J]. INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2022, 112