Discriminative restricted Boltzmann machine with trainable sparsity

被引：0

作者：

Yasuda, Muneki ^{[1
]}

Katsumata, Tomu ^{[2
]}

机构：

[1] Yamagata Univ, Grad Sch Sci & Engn, 4-3-16 Jyounan, Yonezawa, Yamagata 9928510, Japan

[2] KADOKAWA Connected Inc, Integrated Data Serv Dept, 2-13-3 Fujimi,Chiyoda Ku, Tokyo 1028177, Japan

来源：

IEICE NONLINEAR THEORY AND ITS APPLICATIONS | 2023年 / 14卷 / 02期

关键词：

classification; statistical machine learning; discriminative restricted Boltzmann machine; trainable sparse regularization;

D O I：

10.1587/nolta.14.207

中图分类号：

O1 [数学];

学科分类号：

0701 ; 070101 ;

摘要：

Discriminative restricted Boltzmann machine (DRBM) is a probabilistic threelayered neural network, consisting of the input, hidden, and output layers, that helps to solve classification problems. This study attempts to improve the generalization property of the DRBM. Regularization methods such as L-1 or L-2 regularizations can be used to control the representation power of a learning model and suppress over-fitting to a dataset. To control the representation power of the DRBM, an alternative regularization approach is proposed, in which sparse regularization is imposed on the values of the hidden variables of the DRBM. In the resultant model, the sparse regularization controls the effective size of the hidden layer of the DRBM. Unlike standard regularization methods, in the proposed model, parameters that control the sparsity strength are trainable. The method is validated through numerical experiments based on benchmark datasets.

引用

页码：207 / 214

页数：8

共 14 条

[11] Smolensky P., 1986, Information Processing in Dynamical Systems: Foundations of Harmony Theory
[12] A New Sparse Restricted Boltzmann Machine
Wei, Jiangshu
Lv, Jiancheng
Yi, Zhang
[J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2019, 33 (10)
[13] Yasuda M., 2022, P 2022 INT S NONL TH
[14] Restricted Boltzmann Machine with Multivalued Hidden Variables A Model Suppressing Over-Fitting
Yokoyama, Yuuki
Katsumata, Tomu
Yasuda, Muneki
[J]. REVIEW OF SOCIONETWORK STRATEGIES, 2019, 13 (02) : 253 - 266

← 1 2 →