SparsePose: Sparse-View Camera Pose Regression and Refinement

被引:12
|
作者
Sinha, Samarth [1 ]
Zhang, Jason Y. [2 ]
Tagliasacchi, Andrea [1 ,3 ,4 ]
Gilitschenski, Igor [1 ]
Lindell, David B. [1 ,5 ]
机构
[1] Univ Toronto, Toronto, ON, Canada
[2] Carnegie Mellon Univ, Pittsburgh, PA USA
[3] Simon Fraser Univ, Burnaby, BC, Canada
[4] Google, Toronto, ON, Canada
[5] Vector Inst, Toronto, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
10.1109/CVPR52729.2023.02045
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Camera pose estimation is a key step in standard 3D reconstruction pipelines that operate on a dense set of images of a single object or scene. However, methods for pose estimation often fail when only a few images are available because they rely on the ability to robustly identify and match visual features between image pairs. While these methods can work robustly with dense camera views, capturing a large set of images can be time-consuming or impractical. We propose SparsePose for recovering accurate camera poses given a sparse set of wide-baseline images (fewer than 10). The method learns to regress initial camera poses and then iteratively refine them after training on a large-scale dataset of objects (Co3D: Common Objects in 3D). SparsePose significantly outperforms conventional and learning-based baselines in recovering accurate camera rotations and translations. We also demonstrate our pipeline for high-fidelity 3D reconstruction using only 5-9 images of an object. Project webpage: https://sparsepose.github.io/
引用
收藏
页码:21349 / 21359
页数:11
相关论文
共 50 条
  • [21] Noise filtering method in images in sparse-view covers
    Goshin, Y. V.
    Arkhipova, D. V.
    COMPUTER OPTICS, 2024, 48 (03) : 432 - 438
  • [22] DEEP BACK PROJECTION FOR SPARSE-VIEW CT RECONSTRUCTION
    Ye, Dong Hye
    Buzzard, Gregery T.
    Ruby, Max
    Bouman, Charles A.
    2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018), 2018, : 1 - 5
  • [23] Beam Hardening Correction for Sparse-View CT Reconstruction
    Liu, Wenlei
    Rong, Junyan
    Gao, Peng
    Liao, Qimei
    Lu, HongBing
    MEDICAL IMAGING 2015: IMAGE PROCESSING, 2015, 9413
  • [24] Sparse-view tomography via displacement function interpolation
    Gengsheng L. Zeng
    Visual Computing for Industry, Biomedicine, and Art, 2
  • [25] Learning Projection Views for Sparse-View CT Reconstruction
    Yang, Liutao
    Ge, Rongjun
    Feng, Shichang
    Zhang, Daoqiang
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 2645 - 2653
  • [26] Camera Pose Auto-encoders for Improving Pose Regression
    Shavit, Yoli
    Keller, Yosi
    COMPUTER VISION, ECCV 2022, PT X, 2022, 13670 : 140 - 157
  • [27] Combining convolutional sparse coding with total variation for sparse-view CT reconstruction
    Li, Xuru
    Li, Yu
    Chen, Ping
    Li, Fuzhong
    APPLIED OPTICS, 2022, 61 (06) : C116 - C124
  • [28] Sparse-View CT Reconstruction via Generative Adversarial Networks
    Zhao, Zhongwei
    Sun, Yuewen
    Cong, Peng
    2018 IEEE NUCLEAR SCIENCE SYMPOSIUM AND MEDICAL IMAGING CONFERENCE PROCEEDINGS (NSS/MIC), 2018,
  • [29] Relightable and Animatable Neural Avatar from Sparse-View Video
    Xu, Zhen
    Peng, Sida
    Geng, Chen
    Mou, Linzhan
    Yan, Zihan
    Sun, Jiaming
    Bao, Hujun
    Zhou, Xiaowei
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 990 - 1000
  • [30] Pattern-Based Cloth Registration and Sparse-View Animation
    Halimi, Oshri
    Stuyck, Tuur
    Xiang, Donglai
    Bagautdinov, Timur
    Wen, He
    Kimmel, Ron
    Shiratori, Takaaki
    Wu, Chenglei
    Sheikh, Yaser
    Prada, Fabian
    ACM TRANSACTIONS ON GRAPHICS, 2022, 41 (06):