Discriminative restricted Boltzmann machine with trainable sparsity

被引:0
作者
Yasuda, Muneki [1 ]
Katsumata, Tomu [2 ]
机构
[1] Yamagata Univ, Grad Sch Sci & Engn, 4-3-16 Jyounan, Yonezawa, Yamagata 9928510, Japan
[2] KADOKAWA Connected Inc, Integrated Data Serv Dept, 2-13-3 Fujimi,Chiyoda Ku, Tokyo 1028177, Japan
来源
IEICE NONLINEAR THEORY AND ITS APPLICATIONS | 2023年 / 14卷 / 02期
关键词
classification; statistical machine learning; discriminative restricted Boltzmann machine; trainable sparse regularization;
D O I
10.1587/nolta.14.207
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Discriminative restricted Boltzmann machine (DRBM) is a probabilistic threelayered neural network, consisting of the input, hidden, and output layers, that helps to solve classification problems. This study attempts to improve the generalization property of the DRBM. Regularization methods such as L-1 or L-2 regularizations can be used to control the representation power of a learning model and suppress over-fitting to a dataset. To control the representation power of the DRBM, an alternative regularization approach is proposed, in which sparse regularization is imposed on the values of the hidden variables of the DRBM. In the resultant model, the sparse regularization controls the effective size of the hidden layer of the DRBM. Unlike standard regularization methods, in the proposed model, parameters that control the sparsity strength are trainable. The method is validated through numerical experiments based on benchmark datasets.
引用
收藏
页码:207 / 214
页数:8
相关论文
共 14 条
  • [11] Smolensky P., 1986, Information Processing in Dynamical Systems: Foundations of Harmony Theory
  • [12] A New Sparse Restricted Boltzmann Machine
    Wei, Jiangshu
    Lv, Jiancheng
    Yi, Zhang
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2019, 33 (10)
  • [13] Yasuda M., 2022, P 2022 INT S NONL TH
  • [14] Restricted Boltzmann Machine with Multivalued Hidden Variables A Model Suppressing Over-Fitting
    Yokoyama, Yuuki
    Katsumata, Tomu
    Yasuda, Muneki
    [J]. REVIEW OF SOCIONETWORK STRATEGIES, 2019, 13 (02) : 253 - 266