Non-parametric Inference Adaptive to Intrinsic Dimension

被引:0
|
作者
Khosravi, Khashayar [1 ]
Lewis, Greg [2 ]
Syrgkanis, Vasilis [2 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
[2] Microsoft Res, Redmond, WA USA
关键词
non-parametric statistics; inference; intrinsic dimension; conditional moment equation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider non-parametric estimation and inference of conditional moment models in high dimensions. Formally, we consider the problem of finding a parameter vector theta(x) is an element of R-p that is a solution to a set of conditional moment equations of the form: E[psi(Z; theta(x))vertical bar X = x] = 0, (1) when given n i.i.d. samples (Z(1),..., Z(n)) from the distribution of Z, where psi : Z x R-p -> R-p is a known vector valued moment function, Z is an arbitrary data space, X is an element of X subset of R-D is the feature vector that is included Z. We show that even when the dimension D of the conditioning variable is larger than the sample size n, estimation and inference is feasible as long as the distribution of the conditioning variable has small intrinsic dimension d, as measured by locally low doubling measures. Our estimation is based on a sub-sampled ensemble of the k-nearest neighbors (k-NN) Z-estimator. our estimator solves a locally weighted empirical conditional moment equation (theta) over cap (x) solves : Sigma(n)(i=1) K(x, X-i, S) psi(Z(i); theta) = 0, (2) where K(x, X-i, S) is a kernel capturing the proximity of X-i to the target point x. We consider weights K(x, X-i, S) that take the form of an average over B base weights: K(x, X-i, S) = 1/B Sigma(B)(b=1) K(x, X-i, S-b) 1{i is an element of S-b}, where each K(x, X-i, S-b) is calculated based on a randomly drawn sub-sample S-b of size s < n from the original sample. We show that if the intrinsic dimension of the covariate distribution is equal to d, then the finite sample estimation error of our estimator is of order n(-1/(d+2)) and our estimate is n(-1/(d+2)) - asymptotically normal, irrespective of D. The sub-sampling size required for achieving these results depends on the unknown intrinsic dimension d. We propose an adaptive data-driven approach for choosing this parameter and prove that it achieves the desired rates. We discuss extensions and applications to heterogeneous treatment effect estimation.
引用
收藏
页数:1
相关论文
共 50 条
  • [1] Non-parametric inference on the number of equilibria
    Kasy, Maximilian
    ECONOMETRICS JOURNAL, 2015, 18 (01): : 1 - 39
  • [2] Non-Parametric Inference of Relational Dependence
    Ahsan, Ragib
    Fatemi, Zahra
    Arbour, David
    Zheleva, Elena
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 180, 2022, 180 : 54 - 63
  • [3] Statistical inference in the non-parametric case
    Scheffe, H
    ANNALS OF MATHEMATICAL STATISTICS, 1943, 14 : 305 - 332
  • [4] Non-parametric inference for density modes
    Genovese, Christopher R.
    Perone-Pacifico, Marco
    Verdinelli, Isabella
    Wasserman, Larry
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2016, 78 (01) : 99 - 126
  • [5] Non-parametric inference for balanced randomization designs
    Rukhin, Andrew L.
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2007, 137 (03) : 967 - 984
  • [6] Non-parametric Inference and Coordination for Distributed Robotics
    Julian, Brian J.
    Angermann, Michael
    Rus, Daniela
    2012 IEEE 51ST ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2012, : 2787 - 2794
  • [7] Non-parametric inference on calibration of predicted risks
    Sadatsafavi, Mohsen
    Petkau, John
    STATISTICS IN MEDICINE, 2024, 43 (18) : 3524 - 3538
  • [8] NON-PARAMETRIC STATISTICAL INFERENCE FOR THE SURVIVAL EXPERIMENTS
    Ramadurai, M.
    Basha, M. A. Ghouse
    JP JOURNAL OF BIOSTATISTICS, 2021, 18 (03) : 379 - 394
  • [9] Non-parametric Bayesian inference on bivariate extremes
    Guillotte, Simon
    Perron, Francois
    Segers, Johan
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2011, 73 : 377 - 406
  • [10] Parametric and non-parametric gradient matching for network inference: a comparison
    Dony, Leander
    He, Fei
    Stumpf, Michael P. H.
    BMC BIOINFORMATICS, 2019, 20 (1)