Learning Internal Representations of 3D Transformations From 2D Projected Inputs

被引：0

作者：

Connor, Marissa ^{[1
]}

Olshausen, Bruno ^{[2
,3
]}

Rozell, Christopher ^{[1
]}

机构：

[1] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA

[2] Univ Calif Berkeley, Helen Wills Neurosci Inst, Berkeley, CA 94720 USA

[3] Univ Calif Berkeley, Sch Optometry, Berkeley, CA 94720 USA

来源：

NEURAL COMPUTATION | 2024年 / 36卷 / 11期

关键词：

MENTAL ROTATION; KINETIC DEPTH; 3-DIMENSIONAL STRUCTURE; LIE-GROUPS; MOTION; RECONSTRUCTION; MODEL; SHAPE;

D O I：

10.1162/neco_a_01695

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We describe a computational model for inferring 3D structure from the motion of projected 2D points in an image, with the aim of understanding how biological vision systems learn and internally represent 3D transformations from the statistics of their input. The model uses manifold transport operators to describe the action of 3D points in a scene as they undergo transformation. We show that the model can learn the generator of the Lie group for these transformations from purely 2D input, providing a proof-of-concept demonstration for how biological systems could adapt their internal representations based on sensory input. Focusing on a rotational model, we evaluate the ability of the model to infer depth from moving 2D projected points and to learn rotational transformations from 2D training stimuli. Finally, we compare the model performance to psychophysical performance on structure-from-motion tasks.

引用

页码：2505 / 2539

页数：35

共 50 条

[1] Learning 3D Deformation of Animals from 2D Images
Kanazawa, Angjoo
Kovalsky, Shahar
Basri, Ronen
Jacobs, David
COMPUTER GRAPHICS FORUM, 2016, 35 (02) : 365 - 374
[2] 2D/3D image registration using regression learning
Chou, Chen-Rui
Frederick, Brandon
Mageras, Gig
Chang, Sha
Pizer, Stephen
COMPUTER VISION AND IMAGE UNDERSTANDING, 2013, 117 (09) : 1095 - 1106
[3] Learning 3D faces from 2D images via Stacked Contractive Autoencoder
Zhang, Jian
Li, Ke
Liang, Yun
Li, Na
NEUROCOMPUTING, 2017, 257 : 67 - 78
[4] Exploring rich intermediate representations for reconstructing 3D shapes from 2D images
Yang, Yang
Han, Junwei
Zhang, Dingwen
Tian, Qi
PATTERN RECOGNITION, 2022, 122
[5] Local Metric Learning in 2D/3D Deformable Registration With Application in the Abdomen
Zhao, Qingyu
Chou, Chen-Rui
Mageras, Gig
Pizer, Stephen
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2014, 33 (08) : 1592 - 1600
[6] Reconstruction of 3D Microstructures from 2D Images via Transfer Learning
Bostanabad, Ramin
COMPUTER-AIDED DESIGN, 2020, 128
[7] Learning 3D Action Models from a few 2D videos for View Invariant Action Recognition
Natarajan, Pradeep
Singh, Vivek Kumar
Nevatia, Ram
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 2006 - 2013
[8] Computing 3D saliency from a 2D image
Ramenahalli, Sudarshan
Niebur, Ernst
2013 47TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2013,
[9] Generation and quality improvement of 3D models from silhouettes of 2D images
Phothong, Watchama
Wu, Tsung-Chien
Yu, Chun-Yeh
Lai, Jiing-Yih
Wang, Douglas W.
Liao, Chao-Yaug
JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 2018, 41 (03) : 216 - 228
[10] Inferring statistical properties of 3D cell geometry from 2D slices
Sharp, Tristan A.
Merkel, Matthias
Manning, M. Lisa
Liu, Andrea J.
PLOS ONE, 2019, 14 (02):

← 1 2 3 4 5 →