ON THE PROBLEM OF LOCAL MINIMA IN RECURRENT NEURAL NETWORKS

被引:34
作者
BIANCHINI, M
GORI, M
MAGGINI, M
机构
[1] Dipartimento di Sistemi e Informatica, Universita di Firenze, 50139, Firenze
来源
IEEE TRANSACTIONS ON NEURAL NETWORKS | 1994年 / 5卷 / 02期
关键词
D O I
10.1109/72.279182
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many researchers have recently focused their efforts on devising efficient algorithms, mainly based on optimization schemes, for learning the weights of recurrent neural networks. As in the case of feedforward networks, however, these learning algorithms may get stuck in local minima during gradient descent, thus discovering sub-optimal solutions. This paper analyses the problem of optimal learning in recurrent networks by proposing conditions that guarantee local minima free error surfaces. An example is given that also shows the constructive role of the proposed theory in designing networks suitable for solving a given task. Moreover, a formal relationship between recurrent and static feedforward networks is established such that the examples of local minima for feedforward networks already known in the literature can be associated with analogous ones in recurrent networks.
引用
收藏
页码:167 / 172
页数:6
相关论文
共 34 条
  • [1] ALMEIDA LB, 1987, 1ST IEEE ANN INT C N
  • [3] BENGIO Y, 1993, MAR P IEEE ICNN93 SA, P1183
  • [4] BIANCHINI M, 1993, DSI292 U FIR TECHN R
  • [5] Approximation of Boolean Functions by Sigmoidal Networks: Part I: XOR and Other Two-Variable Functions
    Blum, E. K.
    [J]. NEURAL COMPUTATION, 1989, 1 (04) : 532 - 540
  • [6] BACK PROPAGATION FAILS TO SEPARATE WHERE PERCEPTRONS SUCCEED
    BRADY, ML
    RAGHAVAN, R
    SLAWNY, J
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS, 1989, 36 (05): : 665 - 674
  • [7] CETIN BC, 1993, MAR P IEEE ICNN 93 S, V2, P836
  • [8] Finite State Automata and Simple Recurrent Networks
    Cleeremans, Axel
    Servan-Schreiber, David
    McClelland, James L.
    [J]. NEURAL COMPUTATION, 1989, 1 (03) : 372 - 381
  • [9] PHONETICALLY-BASED MULTILAYERED NEURAL NETWORKS FOR VOWEL CLASSIFICATION
    COSI, P
    BENGIO, Y
    DEMORI, R
    [J]. SPEECH COMMUNICATION, 1990, 9 (01) : 15 - 29
  • [10] FINDING STRUCTURE IN TIME
    ELMAN, JL
    [J]. COGNITIVE SCIENCE, 1990, 14 (02) : 179 - 211