Nearest-Neighbor Sampling Based Conditional Independence Testing

被引:0
|
作者
Li, Shuai [1 ]
Chen, Ziqi [1 ]
Zhu, Hongtu [2 ,3 ,4 ,5 ]
Wang, Christina Dan [6 ]
Wen, Wang [7 ]
机构
[1] East China Normal Univ, Sch Stat, KLATASDS MOE, Shanghai, Peoples R China
[2] Univ N Carolina, Dept Biostat, Chapel Hill, NC USA
[3] Univ N Carolina, Dept Stat, Chapel Hill, NC USA
[4] Univ N Carolina, Dept Comp Sci, Chapel Hill, NC USA
[5] Univ N Carolina, Dept Genet, Chapel Hill, NC USA
[6] New York Univ Shanghai, Business Div, Shanghai, Peoples R China
[7] Cent South Univ, Sch Math & Stat, Changsha, Peoples R China
基金
上海市自然科学基金; 中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The conditional randomization test (CRT) was recently proposed to test whether two random variables X and Y are conditionally independent given random variables Z. The CRT assumes that the conditional distribution of X given Z is known under the null hypothesis and then it is compared to the distribution of the observed samples of the original data. The aim of this paper is to develop a novel alternative of CRT by using nearest-neighbor sampling without assuming the exact form of the distribution of X given Z. Specifically, we utilize the computationally efficient 1-nearest-neighbor to approximate the conditional distribution that encodes the null hypothesis. Then, theoretically, we show that the distribution of the generated samples is very close to the true conditional distribution in terms of total variation distance. Furthermore, we take the classifier-based conditional mutual information estimator as our test statistic. The test statistic as an empirical fundamental information theoretic quantity is able to well capture the conditional-dependence feature. We show that our proposed test is computationally very fast, while controlling type I and II errors quite well. Finally, we demonstrate the efficiency of our proposed test in both synthetic and real data analyses.
引用
收藏
页码:8631 / 8639
页数:9
相关论文
共 50 条
  • [1] Conditional independence testing based on a nearest-neighbor estimator of conditional mutual information
    Runge, Jakob
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84, 2018, 84
  • [2] K-Nearest-Neighbor Local Sampling Based Conditional Independence Testing
    Li, Shuai
    Zhang, Yingjie
    Zhu, Hongtu
    Wang, Christina Dan
    Shu, Hai
    Chen, Ziqi
    Sun, Zhuoran
    Yang, Yanfeng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [3] KERNEL AND NEAREST-NEIGHBOR ESTIMATION OF A CONDITIONAL QUANTILE
    BHATTACHARYA, PK
    GANGOPADHYAY, AK
    ANNALS OF STATISTICS, 1990, 18 (03): : 1400 - 1415
  • [4] Topological Nearest-Neighbor Filtering for Sampling-Based Planners
    Sandstrom, Read
    Bregger, Andrew
    Smith, Ben
    Thomas, Shawna
    Amato, Nancy M.
    2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 3053 - 3060
  • [5] ON NEAREST-NEIGHBOR GRAPHS
    PATERSON, MS
    YAO, FF
    LECTURE NOTES IN COMPUTER SCIENCE, 1992, 623 : 416 - 426
  • [6] On Nearest-Neighbor Graphs
    D. Eppstein
    M. S. Paterson
    F. F. Yao
    Discrete & Computational Geometry, 1997, 17 : 263 - 282
  • [7] Nearest-neighbor methods
    Sutton, Clifton
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2012, 4 (03): : 307 - 309
  • [8] On nearest-neighbor graphs
    Eppstein, D
    Paterson, MS
    Yao, FF
    DISCRETE & COMPUTATIONAL GEOMETRY, 1997, 17 (03) : 263 - 282
  • [9] Comparison of sampling methods for estimation of nearest-neighbor index values
    Mauro, Francisco
    Haxtema, Zane
    Temesgen, Hailemariam
    CANADIAN JOURNAL OF FOREST RESEARCH, 2017, 47 (06) : 703 - 715
  • [10] A CONDITIONAL NEAREST-NEIGHBOR SPATIAL-ASSOCIATION MEASURE FOR THE ANALYSIS OF CONDITIONAL LOCATIONAL INTERDEPENDENCE
    OKABE, A
    MIKI, F
    ENVIRONMENT AND PLANNING A, 1984, 16 (02) : 163 - 171