Learning Internal Representations of 3D Transformations From 2D Projected Inputs

被引:0
作者
Connor, Marissa [1 ]
Olshausen, Bruno [2 ,3 ]
Rozell, Christopher [1 ]
机构
[1] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA
[2] Univ Calif Berkeley, Helen Wills Neurosci Inst, Berkeley, CA 94720 USA
[3] Univ Calif Berkeley, Sch Optometry, Berkeley, CA 94720 USA
关键词
MENTAL ROTATION; KINETIC DEPTH; 3-DIMENSIONAL STRUCTURE; LIE-GROUPS; MOTION; RECONSTRUCTION; MODEL; SHAPE;
D O I
10.1162/neco_a_01695
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe a computational model for inferring 3D structure from the motion of projected 2D points in an image, with the aim of understanding how biological vision systems learn and internally represent 3D transformations from the statistics of their input. The model uses manifold transport operators to describe the action of 3D points in a scene as they undergo transformation. We show that the model can learn the generator of the Lie group for these transformations from purely 2D input, providing a proof-of-concept demonstration for how biological systems could adapt their internal representations based on sensory input. Focusing on a rotational model, we evaluate the ability of the model to infer depth from moving 2D projected points and to learn rotational transformations from 2D training stimuli. Finally, we compare the model performance to psychophysical performance on structure-from-motion tasks.
引用
收藏
页码:2505 / 2539
页数:35
相关论文
共 50 条
  • [1] Learning 3D Deformation of Animals from 2D Images
    Kanazawa, Angjoo
    Kovalsky, Shahar
    Basri, Ronen
    Jacobs, David
    COMPUTER GRAPHICS FORUM, 2016, 35 (02) : 365 - 374
  • [2] 2D/3D image registration using regression learning
    Chou, Chen-Rui
    Frederick, Brandon
    Mageras, Gig
    Chang, Sha
    Pizer, Stephen
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2013, 117 (09) : 1095 - 1106
  • [3] Learning 3D faces from 2D images via Stacked Contractive Autoencoder
    Zhang, Jian
    Li, Ke
    Liang, Yun
    Li, Na
    NEUROCOMPUTING, 2017, 257 : 67 - 78
  • [4] Exploring rich intermediate representations for reconstructing 3D shapes from 2D images
    Yang, Yang
    Han, Junwei
    Zhang, Dingwen
    Tian, Qi
    PATTERN RECOGNITION, 2022, 122
  • [5] Local Metric Learning in 2D/3D Deformable Registration With Application in the Abdomen
    Zhao, Qingyu
    Chou, Chen-Rui
    Mageras, Gig
    Pizer, Stephen
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2014, 33 (08) : 1592 - 1600
  • [6] Reconstruction of 3D Microstructures from 2D Images via Transfer Learning
    Bostanabad, Ramin
    COMPUTER-AIDED DESIGN, 2020, 128
  • [7] Learning 3D Action Models from a few 2D videos for View Invariant Action Recognition
    Natarajan, Pradeep
    Singh, Vivek Kumar
    Nevatia, Ram
    2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 2006 - 2013
  • [8] Computing 3D saliency from a 2D image
    Ramenahalli, Sudarshan
    Niebur, Ernst
    2013 47TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2013,
  • [9] Generation and quality improvement of 3D models from silhouettes of 2D images
    Phothong, Watchama
    Wu, Tsung-Chien
    Yu, Chun-Yeh
    Lai, Jiing-Yih
    Wang, Douglas W.
    Liao, Chao-Yaug
    JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 2018, 41 (03) : 216 - 228
  • [10] Inferring statistical properties of 3D cell geometry from 2D slices
    Sharp, Tristan A.
    Merkel, Matthias
    Manning, M. Lisa
    Liu, Andrea J.
    PLOS ONE, 2019, 14 (02):