Discriminative restricted Boltzmann machine (DRBM) is a probabilistic threelayered neural network, consisting of the input, hidden, and output layers, that helps to solve classification problems. This study attempts to improve the generalization property of the DRBM. Regularization methods such as L-1 or L-2 regularizations can be used to control the representation power of a learning model and suppress over-fitting to a dataset. To control the representation power of the DRBM, an alternative regularization approach is proposed, in which sparse regularization is imposed on the values of the hidden variables of the DRBM. In the resultant model, the sparse regularization controls the effective size of the hidden layer of the DRBM. Unlike standard regularization methods, in the proposed model, parameters that control the sparsity strength are trainable. The method is validated through numerical experiments based on benchmark datasets.
引用
收藏
页码:207 / 214
页数:8
相关论文
共 14 条
[11]
Smolensky P., 1986, Information Processing in Dynamical Systems: Foundations of Harmony Theory