Multi-view Consistency as Supervisory Signal for Learning Shape and Pose Prediction

被引:96
|
作者
Tulsiani, Shubham [1 ]
Efros, Alexei A. [1 ]
Malik, Jitendra [1 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/CVPR.2018.00306
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a framework for learning single-view shape and pose prediction without using direct supervision for either. Our approach allows leveraging multi-view observations from unknown poses as supervisory signal during training. Our proposed training setup enforces geometric consistency between the independently predicted shape and pose from two views of the same instance. We consequently learn to predict shape in an emergent canonical (view-agnostic) frame along with a corresponding pose predictor. We show empirical and qualitative results using the ShapeNet dataset and observe encouragingly competitive performance to previous techniques which rely on stronger forms of supervision. We also demonstrate the applicability of our framework in a realistic setting which is beyond the scope of existing techniques: using a training dataset comprised of online product images where the underlying shape and pose are unknown.
引用
收藏
页码:2897 / 2905
页数:9
相关论文
共 50 条
  • [31] circRNA-binding protein site prediction based on multi-view deep learning, subspace learning and multi-view classifier
    Li, Hui
    Deng, Zhaohong
    Yang, Haitao
    Pan, Xiaoyong
    Wei, Zhisheng
    Shen, Hong-Bin
    Choi, Kup-Sze
    Wang, Lei
    Wang, Shitong
    Wu, Jing
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (01)
  • [32] MULTI-VIEW METRIC LEARNING FOR MULTI-VIEW VIDEO SUMMARIZATION
    Wang, Linbo
    Fang, Xianyong
    Guo, Yanwen
    Fu, Yanwei
    2016 INTERNATIONAL CONFERENCE ON CYBERWORLDS (CW), 2016, : 179 - 182
  • [33] Multi-View Consistency for Relation Extraction via Mutual Information and Structure Prediction
    Ben Veyseh, Amir Pouran
    Dernoncourt, Franck
    Thai, My
    Dou, Dejing
    Thien Huu Nguyen
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 9106 - 9113
  • [34] Kernel machine based learning for multi-view face detection and pose estimation
    Li, SZ
    Fu, QD
    Gu, L
    Schölkopf, B
    Cheng, YM
    Zhang, HJ
    EIGHTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOL II, PROCEEDINGS, 2001, : 674 - 679
  • [35] 3D Human Pose and Shape Estimation Through Collaborative Learning and Multi-view Model-fitting
    Li, Zhongguo
    Oskarsson, Magnus
    Heyden, Anders
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 1887 - 1896
  • [36] Multi-View Consistency for Infinitary Regular Languages
    Pittou, Maria
    Tripakis, Stavros
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON EMBEDDED COMPUTER SYSTEMS: ARCHITECTURES, MODELING AND SIMULATION (SAMOS), 2016, : 148 - 155
  • [37] Pyramid Multi-View Stereo with Local Consistency
    Liao, Jie
    Fu, Yanping
    Yan, Qingan
    Xiao, Chunxia
    COMPUTER GRAPHICS FORUM, 2019, 38 (07) : 335 - 346
  • [38] Sparse Multi-View Consistency for Object Segmentation
    Djelouah, Abdelaziz
    Franco, Jean-Sebastien
    Boyer, Edmond
    Le Clerc, Francois
    Perez, Patrick
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (09) : 1890 - 1903
  • [39] Multi-view Gaussian processes with posterior consistency
    Sun, Shiliang
    Sun, Xuli
    Liu, Qiuyang
    INFORMATION SCIENCES, 2021, 547 : 710 - 722
  • [40] Consistency in UML and B multi-view specifications
    Ossami, DDO
    Jacquot, JP
    Souquières, J
    INTEGRATED FORMAL METHODS, PROCEEDINGS, 2005, 3771 : 386 - 405