Least-squares independence regression for non-linear causal inference under non-Gaussian noise

被引:0
作者
Makoto Yamada
Masashi Sugiyama
Jun Sese
机构
[1] 701 1st Ave.,
来源
Machine Learning | 2014年 / 96卷
关键词
Causal inference; Non-linear; Non-Gaussian; Squared-loss mutual information; Least-squares independence regression;
D O I
暂无
中图分类号
学科分类号
摘要
The discovery of non-linear causal relationship under additive non-Gaussian noise models has attracted considerable attention recently because of their high flexibility. In this paper, we propose a novel causal inference algorithm called least-squares independence regression (LSIR). LSIR learns the additive noise model through the minimization of an estimator of the squared-loss mutual information between inputs and residuals. A notable advantage of LSIR is that tuning parameters such as the kernel width and the regularization parameter can be naturally optimized by cross-validation, allowing us to avoid overfitting in a data-dependent fashion. Through experiments with real-world datasets, we show that LSIR compares favorably with a state-of-the-art causal inference method.
引用
收藏
页码:249 / 267
页数:18
相关论文
共 34 条
[1]  
Aronszajn N.(1950)Theory of reproducing kernels Transactions of the American Mathematical Society 68 337-404
[2]  
Faith J. J.(2007)Large-scale mapping and validation of PLoS Biology 5 419-433
[3]  
Hayete B.(1993) transcriptional regulation from a compendium of expression profiles International Statistical Review 61 1871-1905
[4]  
Thaden J. T.(2009)A consistent test for bivariate dependence The Annals of Statistics 37 189-212
[5]  
Mogno I.(2010)Kernel dimension reduction in regression Open Systems & Information Dynamics 17 335-367
[6]  
Wierzbowski J.(2012)Justifying additive noise model-based causal discovery via algorithmic information theory Machine Learning 86 79-86
[7]  
Cottarel G.(2004)Statistical analysis of kernel-based least-squares density-ratio estimation Physical Review E 69 503-528
[8]  
Kasif S.(1951)Estimating mutual information The Annals of Mathematical Statistics 22 157-175
[9]  
Collins J. J.(1989)On information and sufficiency Mathematical Programming Series B 45 2003-2030
[10]  
Gardner T. S.(1900)On the limited memory method for large scale optimization Philosophical Magazine 50 67-93