On the implicit acquisition of a context-free grammar by a simple recurrent neural network

被引:11
作者
Cartling, Bo [1 ]
机构
[1] Royal Inst Technol, AlbaNova Univ Ctr, Dept Theoret Phys, SE-10691 Stockholm, Sweden
关键词
language acquisition; context-free grammar; simple recurrent network; internal representation; generalization capacity;
D O I
10.1016/j.neucom.2007.05.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The performance of a simple recurrent neural network on the implicit acquisition of a context-free grammar is re-examined and found to be significantly higher than previously reported by Elman. This result is obtained although the previous work employed it multilayer extension of the basic form of simple recurrent network and restricted the complexity of training and test corpora. The high performance is traced to a well-organized internal representation of the grammatical elements, as probed by a principal-component analysis of the hidden-layer activities. From the next-symbol-prediction performance on sentences not present in the training corpus, it capacity of generalization is demonstrated. (C) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:1527 / 1537
页数:11
相关论文
共 39 条
[1]  
Abeles M., 1991, Corticonics: Neural circuits of the cerebral cortex, DOI DOI 10.1017/CBO9780511574566
[2]   Context-free and context-sensitive dynamics in recurrent neural networks [J].
Bodén, M ;
Wiles, J .
CONNECTION SCIENCE, 2000, 12 (3-4) :197-210
[3]   The dynamics of discrete-time computation, with application to recurrent neural networks and finite state machine extraction [J].
Casey, M .
NEURAL COMPUTATION, 1996, 8 (06) :1135-1178
[4]  
Chomsky N., 1957, SYNTACTIC STRUCTURES, DOI 10.1515/9783112316009
[5]  
CHOMSKY N, 1962, 65 MIT RES LAB EL Q, V65, P187
[6]  
Chomsky Noam, 1959, Infromation and Control, V2, P137, DOI 10.1016/S0019-9958(59)90362-6
[7]  
Christiansen M. H., 1994, Mind and Language, V9, P273, DOI 10.1111/j.1468-0017.1994.tb00226.x
[8]  
Christiansen MH, 1999, COGNITIVE SCI, V23, P157, DOI 10.1207/s15516709cog2302_2
[9]   Finite State Automata and Simple Recurrent Networks [J].
Cleeremans, Axel ;
Servan-Schreiber, David ;
McClelland, James L. .
NEURAL COMPUTATION, 1989, 1 (03) :372-381
[10]   FINDING STRUCTURE IN TIME [J].
ELMAN, JL .
COGNITIVE SCIENCE, 1990, 14 (02) :179-211