THE LIMITATIONS OF DETERMINISTIC BOLTZMANN MACHINE LEARNING

被引：20

作者：

GALLAND, CC

机构：

来源：

NETWORK-COMPUTATION IN NEURAL SYSTEMS | 1993年 / 4卷 / 03期

关键词：

D O I：

10.1088/0954-898X/4/3/007

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The stochastic Boltzmann machine (SBM) learning procedure allows a system of stochastic binary units at thermal equilibrium to model arbitrary probabilistic distributions of binary vectors, but the inefficiency inherent in stochastic simulations limits its usefulness. By employing mean field theory, the stochastic settling to thermal equilibrium can be replaced by efficient deterministic settling to a steady state. The analogous deterministic Boltzmann machine (DBM) learning rule performs steepest descent in an appropriately defined error measure under certain circumstances and has been empirically shown to solve a variety of non-trivial supervised, input-output problems. However, by applying 'naive' mean field theory to a finite system with non-random interactions, the true stochastic system is not well described, and representational problems result that significantly limit the situations in which the DBM procedure can be successfully applied. It is shown that the independence assumption is unacceptably inaccurate in multiple hidden layer configurations, thus accounting for the empirically observed failure of DBM learning in such networks. Further restrictions in network architecture are suggested that maximize the utility of the supervised DBM procedure, but its inherent limitations are shown to be quite severe. An analogous system based on the TAP equations is also discussed.

引用

页码：355 / 379

页数：25

共 21 条

[1]

[Anonymous], 1986, PARALLEL DISTRIBUTED

[2] 1ST-ORDER AND 2ND-ORDER METHODS FOR LEARNING - BETWEEN STEEPEST DESCENT AND NEWTON METHOD [J].