Disentangling Representations in Restricted Boltzmann Machines without Adversaries

被引:10
作者
Fernandez-de-Cossio-Diaz, Jorge [1 ]
Cocco, Simona [1 ]
Monasson, Remi [1 ]
机构
[1] Sorbonne Univ, Lab Phys Ecole Normale Superieure, CNRS, UMR & PSL Res 8023, Paris, France
关键词
KH DOMAIN;
D O I
10.1103/PhysRevX.13.021003
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
(Received 21 July 2022; revised 16 January 2023; accepted 8 March 2023; published 5 April 2023) A goal of unsupervised machine learning is to build representations of complex high-dimensional data, with simple relations to their properties. Such disentangled representations make it easier to interpret the significant latent factors of variation in the data, as well as to generate new data with desirable features. The methods for disentangling representations often rely on an adversarial scheme, in which representations are tuned to avoid discriminators from being able to reconstruct information about the data properties (labels). Unfortunately, adversarial training is generally difficult to implement in practice. Here we propose a simple, effective way of disentangling representations without any need to train adversarial discriminators and apply our approach to Restricted Boltzmann Machines, one of the simplest representation-based generative models. Our approach relies on the introduction of adequate constraints on the weights during training, which allows us to concentrate information about labels on a small subset of latent variables. The effectiveness of the approach is illustrated with four examples: the CelebA dataset of facial images, the two-dimensional Ising model, the MNIST dataset of handwritten digits, and the taxonomy of protein families. In addition, we show how our framework allows for analytically computing the cost, in terms of the log-likelihood of the data, associated with the disentanglement of their representations.
引用
收藏
页数:24
相关论文
共 62 条
  • [1] Abadir M. K., 2005, Matrix Algebra, V1
  • [2] [Anonymous], 2008, P 25 INT C MACH LEAR
  • [3] aps, US, DOI [10.1103/PhysRevX.13.021003, DOI 10.1103/PHYSREVX.13.021003]
  • [4] Arjovsky M., 2017, arXiv, DOI DOI 10.48550/ARXIV.1701.04862
  • [5] Nonanalytic nonequilibrium field theory: Stochastic reheating of the Ising model
    Aron, Camille
    Kulkarni, Manas
    [J]. PHYSICAL REVIEW RESEARCH, 2020, 2 (04):
  • [6] Landau theory for non-equilibrium steady states
    Aron, Camille
    Chamon, Claudio
    [J]. SCIPOST PHYSICS, 2020, 8 (05):
  • [7] Baxter R. J., 2016, EXACTLY SOLVED MODEL
  • [8] How generic scale invariance influences quantum and classical phase transitions
    Belitz, D
    Kirkpatrick, TR
    Vojta, T
    [J]. REVIEWS OF MODERN PHYSICS, 2005, 77 (02) : 579 - 632
  • [9] Bengio Y., 2012, P ICML WORKSH UNS TR, V7, P19, DOI DOI 10.5555/3045796.3045800
  • [10] RBM-MHC: A Semi-Supervised Machine-Learning Method for Sample-Specific Prediction of Antigen Presentation by HLA-I Alleles
    Bravi, Barbara
    Tubiana, Jerome
    Cocco, Simona
    Monasson, Remi
    Mora, Thierry
    Walczak, Aleksandra M.
    [J]. CELL SYSTEMS, 2021, 12 (02) : 195 - +