Self-organising Neural Discrete Representation Learning a la Kohonen

被引:0
|
作者
Irie, Kazuki [1 ,6 ]
Csordas, Robert [2 ,6 ]
Schmidhuber, Juergen [3 ,4 ,5 ]
机构
[1] Harvard Univ, Ctr Brain Sci, Cambridge, MA 02138 USA
[2] Stanford Univ, Stanford, CA 94305 USA
[3] USI, Swiss AI Lab, IDSIA, Lugano, Switzerland
[4] SUPSI, Lugano, Switzerland
[5] King Abdullah Univ Sci & Technol KAUST, AI Initiat, Thuwal, Saudi Arabia
[6] IDSIA, Lugano, Switzerland
来源
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT I | 2024年 / 15016卷
基金
瑞士国家科学基金会;
关键词
self-organizing maps; Kohonen maps; vector quantisation; VQ-VAE; discrete representation learning; ORGANIZATION; CELLS;
D O I
10.1007/978-3-031-72332-2_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unsupervised learning of discrete representations in neural networks (NNs) from continuous ones is essential for many modern applications. Vector Quantisation (VQ) has become popular for this, in particular in the context of generative models, such as Variational Auto-Encoders (VAEs), where the exponential moving average-based VQ (EMA-VQ) algorithm is often used. Here, we study an alternative VQ algorithm based on Kohonen's learning rule for the Self-Organising Map (KSOM; 1982). EMA-VQ is a special case of KSOM. KSOM is known to offer two potential benefits: empirically, it converges faster than EMA-VQ, and KSOM-generated discrete representations form a topological structure on the grid whose nodes are the discrete symbols, resulting in an artificial version of the brain's topographic map. We revisit these properties by using KSOM in VQ-VAEs for image processing. In our experiments, the speed-up compared to well-configured EMA-VQ is only observable at the beginning of training, but KSOM is generally much more robust, e.g., w.r.t. the choice of initialisation schemes (Our code is public: https://github.com/IDSIA/kohonen-vae. The full version with an appendix can be found at: https://arxiv.org/abs/2302.07950).
引用
收藏
页码:343 / 362
页数:20
相关论文
共 47 条
  • [1] Measuring the international digital divide: an application of Kohonen self-organising maps
    Deichmann, Joel I.
    Eshghi, Abdolreza
    Haughton, Dominique
    Woolford, Sam
    INTERNATIONAL JOURNAL OF KNOWLEDGE AND LEARNING, 2007, 3 (06) : 552 - 575
  • [2] Biotic analogies for self-organising cities
    Narraway, Claire L.
    Davis, Oliver S. P.
    Lowell, Sally
    Lythgoe, Katrina A.
    Turner, J. Scott
    Marshall, Stephen
    ENVIRONMENT AND PLANNING B-URBAN ANALYTICS AND CITY SCIENCE, 2020, 47 (02) : 268 - 286
  • [3] Learning New Motion Primitives in the Mirror Neuron System: A Self-organising Computational Model
    Thill, Serge
    Ziemke, Tom
    FROM ANIMALS TO ANIMATS 11, 2010, 6226 : 413 - 423
  • [4] Challenges in stakeholders self-organising to enhance disaster communication
    Le Roux, Tanya
    Van Niekerk, Dewald
    CORPORATE COMMUNICATIONS, 2020, 25 (01) : 128 - 142
  • [5] Modified self-organising maps with a new topology and initialisation algorithm
    Mohebi, Ehsan
    Bagirov, Adil
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2015, 27 (03) : 351 - 372
  • [6] On self-organising mechanisms from social, business and economic domains
    LIRIS-CNRS, University of Lyon, France
    不详
    不详
    不详
    Inf, 2006, 1 (63-71):
  • [7] Rainfall downscaling of weekly ensemble forecasts using self-organising maps
    Ohba, Masamichi
    Kadokura, Shinji
    Nohara, Daisuke
    Toyoda, Yasushi
    TELLUS SERIES A-DYNAMIC METEOROLOGY AND OCEANOGRAPHY, 2016, 68
  • [8] Specifying Self-organising Logistics System: Openness, Intelligence, and Decentralised Control
    Pan, Shenle
    Trentesaux, Damien
    Sallez, Yves
    SERVICE ORIENTATION IN HOLONIC AND MULTI-AGENT MANUFACTURING, 2017, 694 : 93 - 102
  • [9] On Wires and Cables: Content Analysis of WikiLeaks Using Self-Organising Maps
    Mayer, Rudolf
    Rauber, Andreas
    ADVANCES IN SELF-ORGANIZING MAPS, WSOM 2011, 2011, 6731 : 238 - 246
  • [10] Self-Organising Maps for Classification with Metropolis-Hastings Algorithm for Supervision
    Plonski, Piotr
    Zaremba, Krzysztof
    NEURAL INFORMATION PROCESSING, ICONIP 2012, PT III, 2012, 7665 : 149 - 156