Function Classes for Identifiable Nonlinear Independent Component Analysis

被引:0
作者
Buchholz, Simon [1 ]
Besserve, Michel [1 ]
Scholkopf, Bernhard [1 ]
机构
[1] Max Planck Inst Intelligent Syst, Tubingen, Germany
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022) | 2022年
关键词
ICA; SEPARATION; UNIQUENESS; VARIABLES; EXISTENCE; EQUATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unsupervised learning of latent variable models (LVMs) is widely used to represent data in machine learning. When such models reflect the ground truth factors and the mechanisms mapping them to observations, there is reason to expect that they allow generalization in downstream tasks. It is however well known that such identifiability guaranties are typically not achievable without putting constraints on the model class. This is notably the case for nonlinear Independent Component Analysis, in which the LVMmaps statistically independent variables to observations via a deterministic nonlinear function. Several families of spurious solutions fitting perfectly the data, but that do not correspond to the ground truth factors can be constructed in generic settings. However, recent work suggests that constraining the function class of such models may promote identifiability. Specifically, function classes with constraints on their partial derivatives, gathered in the Jacobian matrix, have been proposed, such as orthogonal coordinate transformations (OCT), which impose orthogonality of the Jacobian columns. In the present work, we prove that a subclass of these transformations, conformal maps, is identifiable and provide novel theoretical results suggesting that OCTs have properties that prevent families of spurious solutions to spoil identifiability in a generic setting.
引用
收藏
页数:16
相关论文
共 54 条
[1]  
Ahlfors L., 1979, AMS CHELSEA PUBLISHI, VThird
[2]  
[Anonymous], 2018, CoRR, abs/1812.02230
[3]  
[Anonymous], **DATA OBJECT**, DOI DOI 10.5281/ZENODO.4296287
[4]  
Balakrishnan A., 2018, 6 INT C LEARN REPR I
[5]   Representation Learning: A Review and New Perspectives [J].
Bengio, Yoshua ;
Courville, Aaron ;
Vincent, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828
[6]  
Bona-Pellissier J., 2022, ADV NEURAL INFORM PR
[7]  
Cartan E., 1925, La geometrie des espaces de Riemann
[8]  
Chen Ricky T. Q., 2018, Advances in Neural Information Processing Systems, V31
[9]  
Ciarlet P. G., 2021, CLASSICS APPL MATH S, VI
[10]  
Ciarlet P. G., 1997, Theory of Plates, Studies in Mathematics and Its Applications, VII