Multi-view Consistency as Supervisory Signal for Learning Shape and Pose Prediction

被引:96
|
作者
Tulsiani, Shubham [1 ]
Efros, Alexei A. [1 ]
Malik, Jitendra [1 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/CVPR.2018.00306
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a framework for learning single-view shape and pose prediction without using direct supervision for either. Our approach allows leveraging multi-view observations from unknown poses as supervisory signal during training. Our proposed training setup enforces geometric consistency between the independently predicted shape and pose from two views of the same instance. We consequently learn to predict shape in an emergent canonical (view-agnostic) frame along with a corresponding pose predictor. We show empirical and qualitative results using the ShapeNet dataset and observe encouragingly competitive performance to previous techniques which rely on stronger forms of supervision. We also demonstrate the applicability of our framework in a realistic setting which is beyond the scope of existing techniques: using a training dataset comprised of online product images where the underlying shape and pose are unknown.
引用
收藏
页码:2897 / 2905
页数:9
相关论文
共 50 条
  • [41] Adaptive Learning Based View Synthesis Prediction for Multi-View Video Coding
    Jinhui Hu
    Ruimin Hu
    Zhongyuan Wang
    Ge Gao
    Mang Duan
    Yan Gong
    Journal of Signal Processing Systems, 2014, 74 : 115 - 126
  • [42] Dual Contrastive Prediction for Incomplete Multi-View Representation Learning
    Lin, Yijie
    Gou, Yuanbiao
    Liu, Xiaotian
    Bai, Jinfeng
    Lv, Jiancheng
    Peng, Xi
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (04) : 4447 - 4461
  • [43] Consistency Meets Inconsistency: A Unified Graph Learning Framework for Multi-view Clustering
    Liang, Youwei
    Huang, Dong
    Wang, Chang-Dong
    2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019), 2019, : 1204 - 1209
  • [44] Robust multi-view discriminant analysis with view-consistency
    Yang, Xiang-Fei
    Li, Chun-Na
    Shao, Yuan-Hai
    INFORMATION SCIENCES, 2022, 596 : 153 - 168
  • [45] Adaptive Learning Based View Synthesis Prediction for Multi-View Video Coding
    Hu, Jinhui
    Hu, Ruimin
    Wang, Zhongyuan
    Gao, Ge
    Duan, Mang
    Gong, Yan
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2014, 74 (01): : 115 - 126
  • [46] Multi-View Image Classification With Visual, Semantic and View Consistency
    Zhang, Chunjie
    Cheng, Jian
    Tian, Qi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 617 - 627
  • [47] Multi-view dreaming: multi-view world model with contrastive learning
    Kinose A.
    Okumura R.
    Okada M.
    Taniguchi T.
    Advanced Robotics, 2023, 37 (19) : 1212 - 1220
  • [48] Contrastive Multi-View Learning for 3D Shape Clustering
    Peng, Bo
    Lin, Guoting
    Lei, Jianjun
    Qin, Tianyi
    Cao, Xiaochun
    Ling, Nam
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 6262 - 6272
  • [49] Reciprocal consistency graph learning by aligning multi-semantic spaces for multi-view clustering
    Jiang, Guangqi
    Wang, Huibing
    Yan, Xiaohong
    Yan, Huizhu
    Peng, Jinjia
    Fu, Xianping
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (06)
  • [50] Multi-view human pose and shape estimation via mesh-aligned voxel interpolation
    Zhang, Yixuan
    Zhang, Jiguang
    Xu, Shibiao
    Xiao, Jun
    INFORMATION FUSION, 2025, 114