Discriminative restricted Boltzmann machine with trainable sparsity

被引:0
作者
Yasuda, Muneki [1 ]
Katsumata, Tomu [2 ]
机构
[1] Yamagata Univ, Grad Sch Sci & Engn, 4-3-16 Jyounan, Yonezawa, Yamagata 9928510, Japan
[2] KADOKAWA Connected Inc, Integrated Data Serv Dept, 2-13-3 Fujimi,Chiyoda Ku, Tokyo 1028177, Japan
来源
IEICE NONLINEAR THEORY AND ITS APPLICATIONS | 2023年 / 14卷 / 02期
关键词
classification; statistical machine learning; discriminative restricted Boltzmann machine; trainable sparse regularization;
D O I
10.1587/nolta.14.207
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Discriminative restricted Boltzmann machine (DRBM) is a probabilistic threelayered neural network, consisting of the input, hidden, and output layers, that helps to solve classification problems. This study attempts to improve the generalization property of the DRBM. Regularization methods such as L-1 or L-2 regularizations can be used to control the representation power of a learning model and suppress over-fitting to a dataset. To control the representation power of the DRBM, an alternative regularization approach is proposed, in which sparse regularization is imposed on the values of the hidden variables of the DRBM. In the resultant model, the sparse regularization controls the effective size of the hidden layer of the DRBM. Unlike standard regularization methods, in the proposed model, parameters that control the sparsity strength are trainable. The method is validated through numerical experiments based on benchmark datasets.
引用
收藏
页码:207 / 214
页数:8
相关论文
共 14 条
  • [1] Bishop C. M, 2006, PATTERN RECOGN, DOI [10.1007/978-0-387-45528-0, DOI 10.1007/978-0-387-45528-0]
  • [2] Glorot X., 2010, P 13 INT C ART INT S, P249, DOI DOI 10.1109/LGRS.2016.2565705
  • [3] Training products of experts by minimizing contrastive divergence
    Hinton, GE
    [J]. NEURAL COMPUTATION, 2002, 14 (08) : 1771 - 1800
  • [4] Classifying a high resolution image of an urban area using super-object information
    Johnson, Brian
    Xie, Zhixiao
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2013, 83 : 40 - 49
  • [5] Multi-layered Discriminative Restricted Boltzmann Machine with Untrained Probabilistic Layer
    Kanno, Yuri
    Yasuda, Muneki
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7655 - 7660
  • [6] Kingma D. P., 2014, arXiv
  • [7] Larochelle H., 2008, P 25 INT C MACHINE L, P536
  • [8] Larochelle H, 2012, J MACH LEARN RES, V13, P643
  • [9] Rish I., 2014, Sparse Modeling: Theory, Algorithms, and Applications
  • [10] Salakhutdinov R., 2009, Artificial intelligence and statistics, P448