Learning Multisensory Integration and Coordinate Transformation via Density Estimation

被引:38
作者
Makin, Joseph G. [1 ]
Fellows, Matthew R.
Sabes, Philip N.
机构
[1] Univ Calif San Francisco, Dept Physiol, San Francisco, CA USA
关键词
POSTERIOR PARIETAL CORTEX; CORTICAL CONNECTIONS; SENSORY INTEGRATION; REFERENCE FRAMES; NEURAL NETWORK; SENSORIMOTOR; MACAQUE; AREA; INFORMATION; ADAPTATION;
D O I
10.1371/journal.pcbi.1003035
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Sensory processing in the brain includes three key operations: multisensory integration-the task of combining cues into a single estimate of a common underlying stimulus; coordinate transformations-the change of reference frame for a stimulus (e.g., retinotopic to body-centered) effected through knowledge about an intervening variable (e.g., gaze position); and the incorporation of prior information. Statistically optimal sensory processing requires that each of these operations maintains the correct posterior distribution over the stimulus. Elements of this optimality have been demonstrated in many behavioral contexts in humans and other animals, suggesting that the neural computations are indeed optimal. That the relationships between sensory modalities are complex and plastic further suggests that these computations are learned-but how? We provide a principled answer, by treating the acquisition of these mappings as a case of density estimation, a well-studied problem in machine learning and statistics, in which the distribution of observed data is modeled in terms of a set of fixed parameters and a set of latent variables. In our case, the observed data are unisensory-population activities, the fixed parameters are synaptic connections, and the latent variables are multisensory-population activities. In particular, we train a restricted Boltzmann machine with the biologically plausible contrastive-divergence rule to learn a range of neural computations not previously demonstrated under a single approach: optimal integration; encoding of priors; hierarchical integration of cues; learning when not to integrate; and coordinate transformation. The model makes testable predictions about the nature of multisensory representations.
引用
收藏
页数:17
相关论文
共 68 条
[1]   The ventriloquist effect results from near-optimal bimodal integration [J].
Alais, D ;
Burr, D .
CURRENT BIOLOGY, 2004, 14 (03) :257-262
[2]  
[Anonymous], 2011, P 28 INT C INT C MAC
[3]   SOME INFORMATIONAL ASPECTS OF VISUAL PERCEPTION [J].
ATTNEAVE, F .
PSYCHOLOGICAL REVIEW, 1954, 61 (03) :183-193
[4]   Reference frames for representing visual and tactile locations in parietal cortex [J].
Avillac, M ;
Denève, S ;
Olivier, E ;
Pouget, A ;
Duhamel, JR .
NATURE NEUROSCIENCE, 2005, 8 (07) :941-949
[5]   Redundancy reduction revisited [J].
Barlow, H .
NETWORK-COMPUTATION IN NEURAL SYSTEMS, 2001, 12 (03) :241-253
[6]  
Barlow H.B., 1961, SENS COMMUN, V1, DOI DOI 10.7551/MITPRESS/9780262518420.003.0013
[7]   Multiple levels of representation of reaching in the parieto-frontal network [J].
Battaglia-Mayer, A ;
Caminiti, R ;
Lacquaniti, F ;
Zago, M .
CEREBRAL CORTEX, 2003, 13 (10) :1009-1022
[8]   SELF-ORGANIZING NEURAL NETWORK THAT DISCOVERS SURFACES IN RANDOM-DOT STEREOGRAMS [J].
BECKER, S ;
HINTON, GE .
NATURE, 1992, 355 (6356) :161-163
[9]   STORAGE OF A SENSORY PATTERN BY ANTI-HEBBIAN SYNAPTIC PLASTICITY IN AN ELECTRIC FISH [J].
BELL, CC ;
CAPUTI, A ;
GRANT, K ;
SERRIER, J .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1993, 90 (10) :4650-4654
[10]   Coding of the Reach Vector in Parietal Area 5d [J].
Bremner, Lindsay R. ;
Andersen, Richard A. .
NEURON, 2012, 75 (02) :342-351