Hierarchical VAEs provide a normative account of motion processing in the primate brain

被引:0
作者
Vafaii, Hadi [1 ]
Yates, Jacob L. [2 ]
Butts, Daniel A. [1 ]
机构
[1] Univ Maryland, College Pk, MD 20742 USA
[2] Univ Calif Berkeley, Berkeley, CA USA
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年
关键词
NEURAL-NETWORKS; BAYESIAN-INFERENCE; SELF-MOTION; PERCEPTION; MODELS; FIELD; UNCERTAINTY; ILLUSIONS; FRAMEWORK; RESPONSES;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The relationship between perception and inference, as postulated by Helmholtz in the 19th century, is paralleled in modern machine learning by generative models like Variational Autoencoders (VAEs) and their hierarchical variants. Here, we evaluate the role of hierarchical inference and its alignment with brain function in the domain of motion perception. We first introduce a novel synthetic data framework, Retinal Optic Flow Learning (ROFL), which enables control over motion statistics and their causes. We then present a new hierarchical VAE and test it against alternative models on two downstream tasks: (i) predicting ground truth causes of retinal optic flow (e.g., self-motion); and (ii) predicting the responses of neurons in the motion processing pathway of primates. We manipulate the model architectures (hierarchical versus non-hierarchical), loss functions, and the causal structure of the motion stimuli. We find that hierarchical latent structure in the model leads to several improvements. First, it improves the linear decodability of ground truth factors and does so in a sparse and disentangled manner. Second, our hierarchical VAE outperforms previous state-of-the-art models in predicting neuronal responses and exhibits sparse latent-to-neuron relationships. These results depend on the causal structure of the world, indicating that alignment between brains and artificial neural networks depends not only on architecture but also on matching ecologically relevant stimulus statistics. Taken together, our results suggest that hierarchical Bayesian inference underlines the brain's understanding of the world, and hierarchical VAEs can effectively model this understanding.
引用
收藏
页数:39
相关论文
共 151 条
  • [1] SPATIOTEMPORAL ENERGY MODELS FOR THE PERCEPTION OF MOTION
    ADELSON, EH
    BERGEN, JR
    [J]. JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1985, 2 (02) : 284 - 299
  • [2] al-Haytham Ibn, BOOK OPTICS, P1011
  • [3] [Anonymous], 1867, Handbuch der physiologischen Optik
  • [4] On invariance and selectivity in representation learning
    Anselmi, Fabio
    Rosasco, Lorenzo
    Poggio, Tomaso
    [J]. INFORMATION AND INFERENCE-A JOURNAL OF THE IMA, 2016, 5 (02) : 134 - 158
  • [5] Analyzing biological and artificial neural networks: challenges with opportunities for synergy?
    Barrett, David G. T.
    Morcos, Ari S.
    Macke, Jakob H.
    [J]. CURRENT OPINION IN NEUROBIOLOGY, 2019, 55 : 55 - 64
  • [6] Canonical Microcircuits for Predictive Coding
    Bastos, Andre M.
    Usrey, W. Martin
    Adams, Rick A.
    Mangun, George R.
    Fries, Pascal
    Friston, Karl J.
    [J]. NEURON, 2012, 76 (04) : 695 - 711
  • [7] Representation Learning: A Review and New Perspectives
    Bengio, Yoshua
    Courville, Aaron
    Vincent, Pascal
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) : 1798 - 1828
  • [8] CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING
    BENJAMINI, Y
    HOCHBERG, Y
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) : 289 - 300
  • [9] 3D Visual Response Properties of MSTd Emerge from an Efficient, Sparse Population Code
    Beyeler, Michael
    Dutt, Nikil
    Krichmar, Jeffrey L.
    [J]. JOURNAL OF NEUROSCIENCE, 2016, 36 (32) : 8399 - 8415
  • [10] Bowman S., 2016, P 20 SIGNLL C COMPUT, P10, DOI DOI 10.18653/V1/K16-1002