Nonparametric independence testing via mutual information

被引:68
作者
Berrett, T. B. [1 ]
Samworth, R. J. [1 ]
机构
[1] Univ Cambridge, Stat Lab, Wilberforce Rd, Cambridge CB3 0WB, England
基金
英国工程与自然科学研究理事会;
关键词
Entropy estimation; Independence test; Mutual information; Nearest neighbour; COMPONENT ANALYSIS; DEPENDENCE; BOOTSTRAP;
D O I
10.1093/biomet/asz024
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We propose a test of independence of two multivariate random vectors, given a sample from the underlying population. Our approach is based on the estimation of mutual information, whose decomposition into joint and marginal entropies facilitates the use of recently developed efficient entropy estimators derived from nearest neighbour distances. The proposed critical values may be obtained by simulation in the case where an approximation to one marginal is available or by permuting the data otherwise. This facilitates size guarantees, and we provide local power analyses, uniformly over classes of densities whose mutual information satisfies a lower bound. Our ideas may be extended to provide new goodness-of-fit tests for normal linear models based on assessing the independence of our vector of covariates and an appropriately defined notion of an error vector. The theory is supported by numerical studies on both simulated and real data.
引用
收藏
页码:547 / 566
页数:20
相关论文
共 55 条
[1]   BOOTSTRAP AND PERMUTATION TESTS OF INDEPENDENCE FOR POINT PROCESSES [J].
Albert, Melisande ;
Bouret, Yann ;
Fromont, Magalie ;
Reynaud-Bouret, Patricia .
ANNALS OF STATISTICS, 2015, 43 (06) :2537-2564
[2]  
[Anonymous], 1987, PROBL INFORM TRANSM
[3]  
[Anonymous], 2009, CAUSALITY, DOI DOI 10.1017/CBO9780511803161
[4]  
[Anonymous], ARXIV180705405V2
[5]  
[Anonymous], 1989, Statistical Science
[6]  
[Anonymous], ARXIV150101617V5
[7]  
[Anonymous], ARXIV180407203V2
[8]  
[Anonymous], 2019, R LANGUAGE ENV STAT
[9]  
[Anonymous], SYMRC ESTIMATING SYM
[10]  
[Anonymous], ARXIV161004782