Probabilistic Symmetries and Invariant Neural Networks

被引:0
作者
Bloem-Reddy, Benjamin [1 ]
Teh, Yee Whye [2 ]
机构
[1] Univ British Columbia, Dept Stat, Vancouver, BC V6T 1Z4, Canada
[2] Univ Oxford, Dept Stat, Oxford OX1 3LB, England
基金
欧洲研究理事会;
关键词
probabilistic symmetry; convolutional neural networks; exchangeability; neural architectures; invariance; equivariance; sufficiency; adequacy; graph neural networks; THEOREM; MODELS; DISTRIBUTIONS; GENERALIZE; GRAPHS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Treating neural network inputs and outputs as random variables, we characterize the structure of neural networks that can be used to model data that are invariant or equivariant under the action of a compact group. Much recent research has been devoted to encoding invariance under symmetry transformations into neural network architectures, in an effort to improve the performance of deep neural networks in data-scarce, non-i.i.d., or unsupervised settings. By considering group invariance from the perspective of probabilistic symmetry, we establish a link between functional and probabilistic symmetry, and obtain generative functional representations of probability distributions that are invariant or equivariant under the action of a compact group. Our representations completely characterize the structure of neural networks that can be used to model such distributions and yield a general program for constructing invariant stochastic or deterministic neural networks. We demonstrate that examples from the recent literature are special cases, and develop the details of the general program for exchangeable sequences and arrays.
引用
收藏
页数:61
相关论文
共 130 条
[1]  
Abadi M., 2015, TENSORFLOW LARGESCAL
[2]   DISTRIBUTIONS OF MAXIMAL INVARIANTS USING QUOTIENT MEASURES [J].
ANDERSSON, S .
ANNALS OF STATISTICS, 1982, 10 (03) :955-961
[3]  
[Anonymous], ICML
[4]  
[Anonymous], P 2017 ACM C EC COMP
[5]  
[Anonymous], 1988, P AMS CENTENNIAL S
[6]  
[Anonymous], 2018, INT C MACH LEARN
[7]  
[Anonymous], 2016, Advances in Neural Information Processing Systems
[8]  
[Anonymous], NEURIPS WORKSH BAYES
[9]  
[Anonymous], ADV NEURAL INFORM PR
[10]  
[Anonymous], 1988, PERCEPTRONS EXPANDED