Modeling language and cognition with deep unsupervised learning: a tutorial overview

被引:46
作者
Zorzi, Marco [1 ,2 ]
Testolin, Alberto [1 ]
Stoianov, Ivilin P. [1 ,3 ]
机构
[1] Univ Padua, Dept Gen Psychol, Computat Cognit Neurosci Lab, I-35131 Padua, Italy
[2] IRCCS San Camillo Neurorehabil Hosp, Venice, Italy
[3] CNR, Inst Cognit Sci & Technol, Rome, Italy
来源
FRONTIERS IN PSYCHOLOGY | 2013年 / 4卷
关键词
neural networks; connectionist modeling; deep learning; hierarchical generative models; unsupervised learning; visual word recognition; PROBABILISTIC MODELS; CONNECTIONIST; REPRESENTATIONS; ORGANIZATION; RECOGNITION; PRINCIPLES; EMERGENCE; ALGORITHM; DYSLEXIA; WORDS;
D O I
10.3389/fpsyg.2013.00515
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Deep unsupervised learning in stochastic recurrent neural networks with many layers of hidden units is a recent breakthrough in neural computation research. These networks build a hierarchy of progressively more complex distributed representations of the sensory data by fitting a hierarchical generative model. In this article we discuss the theoretical foundations of this approach and we review key issues related to training, testing and analysis of deep networks for modeling language and cognitive processing. The classic letter and word perception problem of McClelland and Rumelhart (1981) is used as a tutorial example to illustrate how structured and abstract representations may emerge from deep generative learning. We argue that the focus on deep architectures and generative (rather than discriminative) learning represents a crucial step forward for the connectionist modeling enterprise, because it offers a more plausible model of cortical learning as well as a way to bridge the gap between emergentist connectionist models and structured Bayesian models of cognition.
引用
收藏
页数:14
相关论文
共 94 条
  • [1] ACKLEY DH, 1985, COGNITIVE SCI, V9, P147
  • [2] Albert A., 1972, Regression and the Moore-Penrose pseudoinverse
  • [3] An introduction to MCMC for machine learning
    Andrieu, C
    de Freitas, N
    Doucet, A
    Jordan, MI
    [J]. MACHINE LEARNING, 2003, 50 (1-2) : 5 - 43
  • [4] [Anonymous], INT C MACH LEARN ED
  • [5] [Anonymous], COGN SCI IN PRESS
  • [6] [Anonymous], 2000, Computational Explorations in Cognitive Neuroscience: Understanding the Mind by Simulating the Brain
  • [7] [Anonymous], 1986, Parallel Distributed Processing: Explorations in the Microstructure of Cognition
  • [8] [Anonymous], 2006, NeurIPS
  • [9] [Anonymous], EUR S ART NEUR NETW
  • [10] [Anonymous], 2008, ADV NEURAL INFORM PR