Brain-inspired multimodal learning based on neural networks

被引:1
作者
Chang Liu
Fuchun Sun
Bo Zhang
机构
[1] DepartmentofComputerScienceandTechnology,TsinghuaUniversity
关键词
multimodal learning; brain-inspired learning; deep learning; neural networks;
D O I
暂无
中图分类号
R338 [神经生理学];
学科分类号
0710 ; 071006 ;
摘要
Modern computational models have leveraged biological advances in human brain research. This study addresses the problem of multimodal learning with the help of brain-inspired models. Specifically, a unified multimodal learning architecture is proposed based on deep neural networks, which are inspired by the biology of the visual cortex of the human brain. This unified framework is validated by two practical multimodal learning tasks: image captioning, involving visual and natural language signals, and visual-haptic fusion, involving haptic and visual signals. Extensive experiments are conducted under the framework, and competitive results are achieved.
引用
收藏
页码:61 / 72
页数:12
相关论文
共 12 条
[1]  
Robotic learning of haptic adjectives through physical interaction[J] . Vivian Chu,Ian McMahon,Lorenzo Riano,Craig G. McDonald,Qin He,Jorge Martinez Perez-Tejada,Michael Arrigo,Trevor Darrell,Katherine J. Kuchenbecker.Robotics and Autonomous Systems . 2015
[2]  
Multimodal Data Fusion: An Overview of Methods, Challenges, and Prospects[J] . Lahat,Dana,Adali,Tuelay,Jutten,Christian.Proceedings of the IEEE . 2015 (9)
[3]   Sparsity-Regularized HMAX for Visual Recognition [J].
Hu, Xiaolin ;
Zhang, Jianwei ;
Li, Jianmin ;
Zhang, Bo .
PLOS ONE, 2014, 9 (01)
[4]  
Anatomy of hierarchy: Feedforward and feedback pathways in macaque visual cortex[J] . Nikola T. Markov,Julien Vezoli,Pascal Chameau,Arnaud Falchier,René Quilodran,Cyril Huissoud,Camille Lamy,Pierre Misery,Pascale Giroud,Shimon Ullman,Pascal Barone,Colette Dehay,Kenneth Knoblauch,Henry Kennedy.J. Comp. Neurol. . 2014 (1)
[5]  
Framing Image Description as a Ranking Task: Data, Models and Evaluation Metrics[J] . Micah Hodosh,Peter Young,Julia Hockenmaier.J. Artif. Intell. Res. . 2013
[6]  
Multimodal fusion for multimedia analysis: a survey[J] . Pradeep K. Atrey,M. Anwar Hossain,Abdulmotaleb El Saddik,Mohan S. Kankanhalli.Multimedia Systems . 2010 (6)
[7]   Hierarchical models of object recognition in cortex [J].
Riesenhuber, M ;
Poggio, T .
NATURE NEUROSCIENCE, 1999, 2 (11) :1019-1025
[8]   Gradient-based learning applied to document recognition [J].
Lecun, Y ;
Bottou, L ;
Bengio, Y ;
Haffner, P .
PROCEEDINGS OF THE IEEE, 1998, 86 (11) :2278-2324
[9]   SEPARATE VISUAL PATHWAYS FOR PERCEPTION AND ACTION [J].
GOODALE, MA ;
MILNER, AD .
TRENDS IN NEUROSCIENCES, 1992, 15 (01) :20-25
[10]  
Corticofugal feedback influences the generation of length tuning in the visual pathway[J] . P. C. Murphy,A. M. Sillito.Nature . 1987