Local polynomial regression with selection biased data

被引:0
作者
Wu, CO [1 ]
机构
[1] Johns Hopkins Univ, GWC Whiting Sch Engn, Dept Math Sci, Baltimore, MD 21218 USA
关键词
cross-validation; local polynomials; nonparametric maximum likelihood estimator; optimal kernel and bandwidths; selection-biased sample;
D O I
暂无
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Let Y and X be real- and Rd-valued random variables. We consider the estimation of the nonparametric regression function m(x) = E(Y/X = x) when s greater than or equal to 1 independent selection-biased samples of (Y,X) are observed. This sampling scheme, which arises naturally in biological and epidemiological studies and many other fields, includes stratified samples, length-biased samples and other weighted distributions. A class of local polynomial estimators of m(x) is derived by smoothing Vardi's nonparametric maximum likelihood estimator of the underlying distribution function. Large sample properties, such as asymptotic distributions and asymptotic mean squared risks, are derived explicitly Unlike local polynomial regression with i.i.d. direct samples, we show here that kernel choices are important and optimal kernel functions may be asymmetric and discontinuous when the weight functions of the biased samples have jumps. A cross-validation criterion is proposed for the selection of data-driven bandwidths. Through a simple comparison, we show that our estimators are superior to other intuitive estimators of m(x).
引用
收藏
页码:789 / 817
页数:29
相关论文
共 32 条
[1]   ON MULTIVARIATE KERNEL ESTIMATION FOR SAMPLES FROM WEIGHTED DISTRIBUTIONS [J].
AHMAD, IA .
STATISTICS & PROBABILITY LETTERS, 1995, 22 (02) :121-129
[2]  
[Anonymous], ENCY STAT SCI
[3]  
[Anonymous], 1989, STAT DATA ANAL INFER, DOI DOI 10.1016/B978-0-444-88029-1.50035-6
[4]   LARGE SAMPLE THEORY OF ESTIMATION IN BIASED SAMPLING REGRESSION-MODELS .1. [J].
BICKEL, PJ ;
RITOV, J .
ANNALS OF STATISTICS, 1991, 19 (02) :797-816
[5]  
Cheng MY, 1997, ANN STAT, V25, P1691
[6]   SOME STABILIZED BANDWIDTH SELECTORS FOR NONPARAMETRIC REGRESSION [J].
CHIU, ST .
ANNALS OF STATISTICS, 1991, 19 (03) :1528-1546
[7]  
Eubank R.L., 1988, SPLINE SMOOTHING NON
[8]  
Fan J., 1996, Local Polynomial Modelling and Its Applications: Monographs on Statistics and Applied Probability
[9]  
FAN JQ, 1995, J ROY STAT SOC B, V57, P371
[10]   Local polynomial regression: Optimal kernels and asymptotic minimax efficiency [J].
Fan, JQ ;
Gasser, T ;
Gijbels, I ;
Brockmann, M ;
Engel, J .
ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 1997, 49 (01) :79-99