Network model of predictive coding based on reservoir computing for multi-modal processing of visual and auditory signals

被引:9
作者
Yonemura, Yoshihiro [1 ]
Katori, Yuichi [1 ,2 ]
机构
[1] Future Univ Hakodate, Sch Syst Informat Sci, 116-2 Kamedanakano Cho, Hakodate, Hokkaido 0418655, Japan
[2] Univ Tokyo, Inst Ind Sci, Meguro Ku, 4-6-1 Komaba, Tokyo 1538505, Japan
来源
IEICE NONLINEAR THEORY AND ITS APPLICATIONS | 2021年 / 12卷 / 02期
关键词
reservoir computing; predictive coding; multi-modal integration;
D O I
10.1587/nolta.12.143
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
We propose a hierarchical network model based on predictive coding and reservoir computing as a model of multi-modal sensory integration in the brain. The network is composed of visual, auditory, and integration areas. In each area, the dynamical reservoir acts as a generative model that reproduces the time-varying sensory signal. The states of the visual and auditory reservoir are spatially compressed and are sent to the integration area. We evaluate the model with a dataset of time courses, including a pair of visual (hand-written characters) and auditory (read utterances) signal. We show that the model learns the association of multiple modalities of the sensory signals and that the model reconstructs the visual signal from a given corresponding auditory signal. Our approach presents a novel dynamical mechanism of the multi-modal information processing in the brain and the fundamental technology for a brain like an artificial intelligence system.
引用
收藏
页码:143 / 156
页数:14
相关论文
共 19 条
[1]   The hierarchically mechanistic mind: an evolutionary systems theory of the human brain, cognition, and behavior [J].
Badcock, Paul B. ;
Friston, Karl J. ;
Ramstead, Maxwell J. D. ;
Ploeger, Annemie ;
Hohwy, Jakob .
COGNITIVE AFFECTIVE & BEHAVIORAL NEUROSCIENCE, 2019, 19 (06) :1319-1351
[2]   Hierarchical Models in the Brain [J].
Friston, Karl J. .
PLOS COMPUTATIONAL BIOLOGY, 2008, 4 (11)
[3]   Auditory-visual integration during multimodal object recognition in humans: A behavioral and electrophysiological study [J].
Giard, MH ;
Peronnet, F .
JOURNAL OF COGNITIVE NEUROSCIENCE, 1999, 11 (05) :473-490
[5]  
Jaeger H., 2002, TUTORIAL TRAINING RE, V5
[6]   Network Model for Dynamics of Perception with Reservoir Computing and Predictive Coding [J].
Katori, Yuichi .
ADVANCES IN COGNITIVE NEURODYNAMICS (VI), 2018, :89-95
[7]   Gradient-based learning applied to document recognition [J].
Lecun, Y ;
Bottou, L ;
Bengio, Y ;
Haffner, P .
PROCEEDINGS OF THE IEEE, 1998, 86 (11) :2278-2324
[8]  
Lyon R. F., 1982, Proceedings of ICASSP 82. IEEE International Conference on Acoustics, Speech and Signal Processing, P1282
[9]   Real-time computing without stable states:: A new framework for neural computation based on perturbations [J].
Maass, W ;
Natschläger, T ;
Markram, H .
NEURAL COMPUTATION, 2002, 14 (11) :2531-2560
[10]   Physical reservoir computing-an introductory perspective [J].
Nakajima, Kohei .
JAPANESE JOURNAL OF APPLIED PHYSICS, 2020, 59 (06)