Hierarchical VAEs provide a normative account of motion processing in the primate brain

被引：0

作者：

Vafaii, Hadi ^{[1
]}

Yates, Jacob L. ^{[2
]}

Butts, Daniel A. ^{[1
]}

机构：

[1] Univ Maryland, College Pk, MD 20742 USA

[2] Univ Calif Berkeley, Berkeley, CA USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年

关键词：

NEURAL-NETWORKS; BAYESIAN-INFERENCE; SELF-MOTION; PERCEPTION; MODELS; FIELD; UNCERTAINTY; ILLUSIONS; FRAMEWORK; RESPONSES;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The relationship between perception and inference, as postulated by Helmholtz in the 19th century, is paralleled in modern machine learning by generative models like Variational Autoencoders (VAEs) and their hierarchical variants. Here, we evaluate the role of hierarchical inference and its alignment with brain function in the domain of motion perception. We first introduce a novel synthetic data framework, Retinal Optic Flow Learning (ROFL), which enables control over motion statistics and their causes. We then present a new hierarchical VAE and test it against alternative models on two downstream tasks: (i) predicting ground truth causes of retinal optic flow (e.g., self-motion); and (ii) predicting the responses of neurons in the motion processing pathway of primates. We manipulate the model architectures (hierarchical versus non-hierarchical), loss functions, and the causal structure of the motion stimuli. We find that hierarchical latent structure in the model leads to several improvements. First, it improves the linear decodability of ground truth factors and does so in a sparse and disentangled manner. Second, our hierarchical VAE outperforms previous state-of-the-art models in predicting neuronal responses and exhibits sparse latent-to-neuron relationships. These results depend on the causal structure of the world, indicating that alignment between brains and artificial neural networks depends not only on architecture but also on matching ecologically relevant stimulus statistics. Taken together, our results suggest that hierarchical Bayesian inference underlines the brain's understanding of the world, and hierarchical VAEs can effectively model this understanding.

引用

页数：39

共 151 条

[1] SPATIOTEMPORAL ENERGY MODELS FOR THE PERCEPTION OF MOTION
ADELSON, EH
BERGEN, JR
[J]. JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1985, 2 (02) : 284 - 299
[2] al-Haytham Ibn, BOOK OPTICS, P1011
[3] [Anonymous], 1867, Handbuch der physiologischen Optik
[4] On invariance and selectivity in representation learning
Anselmi, Fabio
Rosasco, Lorenzo
Poggio, Tomaso
[J]. INFORMATION AND INFERENCE-A JOURNAL OF THE IMA, 2016, 5 (02) : 134 - 158
[5] Analyzing biological and artificial neural networks: challenges with opportunities for synergy?
Barrett, David G. T.
Morcos, Ari S.
Macke, Jakob H.
[J]. CURRENT OPINION IN NEUROBIOLOGY, 2019, 55 : 55 - 64
[6] Canonical Microcircuits for Predictive Coding
Bastos, Andre M.
Usrey, W. Martin
Adams, Rick A.
Mangun, George R.
Fries, Pascal
Friston, Karl J.
[J]. NEURON, 2012, 76 (04) : 695 - 711
[7] Representation Learning: A Review and New Perspectives
Bengio, Yoshua
Courville, Aaron
Vincent, Pascal
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) : 1798 - 1828
[8] CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING
BENJAMINI, Y
HOCHBERG, Y
[J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) : 289 - 300
[9] 3D Visual Response Properties of MSTd Emerge from an Efficient, Sparse Population Code
Beyeler, Michael
Dutt, Nikil
Krichmar, Jeffrey L.
[J]. JOURNAL OF NEUROSCIENCE, 2016, 36 (32) : 8399 - 8415
[10] Bowman S., 2016, P 20 SIGNLL C COMPUT, P10, DOI DOI 10.18653/V1/K16-1002

← 1 2 3 4 5 6 7 8 9 10 →