THE LIMITATIONS OF DETERMINISTIC BOLTZMANN MACHINE LEARNING

被引:20
作者
GALLAND, CC
机构
关键词
D O I
10.1088/0954-898X/4/3/007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The stochastic Boltzmann machine (SBM) learning procedure allows a system of stochastic binary units at thermal equilibrium to model arbitrary probabilistic distributions of binary vectors, but the inefficiency inherent in stochastic simulations limits its usefulness. By employing mean field theory, the stochastic settling to thermal equilibrium can be replaced by efficient deterministic settling to a steady state. The analogous deterministic Boltzmann machine (DBM) learning rule performs steepest descent in an appropriately defined error measure under certain circumstances and has been empirically shown to solve a variety of non-trivial supervised, input-output problems. However, by applying 'naive' mean field theory to a finite system with non-random interactions, the true stochastic system is not well described, and representational problems result that significantly limit the situations in which the DBM procedure can be successfully applied. It is shown that the independence assumption is unacceptably inaccurate in multiple hidden layer configurations, thus accounting for the empirically observed failure of DBM learning in such networks. Further restrictions in network architecture are suggested that maximize the utility of the supervised DBM procedure, but its inherent limitations are shown to be quite severe. An analogous system based on the TAP equations is also discussed.
引用
收藏
页码:355 / 379
页数:25
相关论文
共 21 条
[1]  
[Anonymous], 1986, PARALLEL DISTRIBUTED
[2]   1ST-ORDER AND 2ND-ORDER METHODS FOR LEARNING - BETWEEN STEEPEST DESCENT AND NEWTON METHOD [J].
BATTITI, R .
NEURAL COMPUTATION, 1992, 4 (02) :141-166
[3]   EVIDENCE FOR MASSLESS MODES IN THE SOLVABLE MODEL OF A SPIN GLASS [J].
BRAY, AJ ;
MOORE, MA .
JOURNAL OF PHYSICS C-SOLID STATE PHYSICS, 1979, 12 (11) :L441-L448
[4]   DIGITAL DYNAMICS AND THE SIMULATION OF MAGNETIC SYSTEMS [J].
CHOI, MY ;
HUBERMAN, BA .
PHYSICAL REVIEW B, 1983, 28 (05) :2547-2554
[5]  
GALLAND C, 1992, THESIS U TORONTO
[6]  
GALLAND CC, 1993, UNPUB IEEE T NEURAL
[7]  
GALLAND CC, 1989, CRGTR896 U TOR DEP C
[8]   TIME-DEPENDENT STATISTICS OF ISING MODEL [J].
GLAUBER, RJ .
JOURNAL OF MATHEMATICAL PHYSICS, 1963, 4 (02) :294-&
[9]  
Hinton G.E., 1986, P 8 ANN C COGN SCI S, V1, P12
[10]   LEARNING ALGORITHMS AND PROBABILITY-DISTRIBUTIONS IN FEEDFORWARD AND FEEDBACK NETWORKS [J].
HOPFIELD, JJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1987, 84 (23) :8429-8433