Autoencoding slow representations for semi-supervised data-efficient regression

被引:1
作者
Struckmeier, Oliver [1 ]
Tiwari, Kshitij [2 ]
Kyrki, Ville [1 ]
机构
[1] Aalto Univ, Intelligent Robot, Maarintie 8, Espoo 02150, Finland
[2] Univ Oulu, Percept Engn Grp, Erkki Koiso Kanttilan Katu 3, Oulu 90014, Finland
关键词
Unsupervised representation learning; Slowness principle; Data-efficient downstream tasks;
D O I
10.1007/s10994-022-06299-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The slowness principle is a concept inspired by the visual cortex of the brain. It postu-lates that the underlying generative factors of a quickly varying sensory signal change on a different, slower time scale. By applying this principle to state-of-the-art unsupervised representation learning methods one can learn a latent embedding to perform supervised downstream regression tasks more data efficient. In this paper, we compare different approaches to unsupervised slow representation learning such as Lp norm based slowness regularization and the SlowVAE, and propose a new term based on Brownian motion used in our method, the S-VAE. We empirically evaluate these slowness regularization terms with respect to their downstream task performance and data efficiency in state estimation and behavioral cloning tasks. We find that slow representations show great performance improvements in settings where only sparse labeled training data is available. Furthermore, we present a theoretical and empirical comparison of the discussed slowness regulariza-tion terms. Finally, we discuss how the Frechet Inception Distance (FID), commonly used to determine the generative capabilities of GANs, can predict the performance of trained models in supervised downstream tasks.
引用
收藏
页码:2297 / 2315
页数:19
相关论文
共 31 条
[1]  
Alemi A. A., 2016, ARXIV
[2]  
[Anonymous], 2009, Advances in neural information processing systems
[3]  
Barratt S., 2018, arXiv
[4]  
Beattie C., 2016, CoRR
[5]   Representation Learning: A Review and New Perspectives [J].
Bengio, Yoshua ;
Courville, Aaron ;
Vincent, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828
[6]   Slow feature analysis yields a rich repertoire of complex cell properties [J].
Berkes, P ;
Wiskott, L .
JOURNAL OF VISION, 2005, 5 (06) :579-602
[7]  
Burgess C., 2018, ARXIV
[8]   Learning Intermediate-Level Representations of Form and Motion from Natural Movies [J].
Cadieu, Charles F. ;
Olshausen, Bruno A. .
NEURAL COMPUTATION, 2012, 24 (04) :827-866
[9]  
Chen Ting, 2020, ICML
[10]  
Fortuin V, 2020, PR MACH LEARN RES, V108, P1651