DIVIDE AND CONQUER IN NONSTANDARD PROBLEMS AND THE SUPER-EFFICIENCY PHENOMENON

被引:51
作者
Banerjee, Moulinath [1 ]
Durot, Cecile [2 ]
Sen, Bodhisattva [3 ]
机构
[1] Univ Michigan, Dept Stat, 451 West Hall,1085 South Univ, Ann Arbor, MI 48109 USA
[2] Univ Paris Nanterre, Dept Math Informat, 200 Ave Republ, F-92000 Nanterre, France
[3] Columbia Univ, Dept Stat, 1255 Amsterdam Ave,Room 1032 Ssw, New York, NY 10027 USA
关键词
Cube-root asymptotics; isotonic regression; local minimax risk; non-Gaussian limit; sample-splitting; MONOTONE; REGRESSION;
D O I
10.1214/17-AOS1633
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We study how the divide and conquer principle works in non-standard problems where rates of convergence are typically slower than root n and limit distributions are non-Gaussian, and provide a detailed treatment for a variety of important and well-studied problems involving nonparametric estimation of a monotone function. We find that for a fixed model, the pooled estimator, obtained by averaging nonstandard estimates across mutually exclusive subsamples, outperforms the nonstandard monotonicity-constrained (global) estimator based on the entire sample in the sense of pointwise estimation of the function. We also show that, under appropriate conditions, if the number of subsamples is allowed to increase at appropriate rates, the pooled estimator is asymptotically normally distributed with a variance that is empirically estimable from the subsample-level estimates. Further, in the context of monotone regression, we show that this gain in efficiency under a fixed model comes at a price-the pooled estimator's performance, in a uniform sense (maximal risk) over a class of models worsens as the number of subsamples increases, leading to a version of the super-efficiency phenomenon. In the process, we develop analytical results for the order of the bias in isotonic regression, which are of independent interest.
引用
收藏
页码:720 / 757
页数:38
相关论文
共 31 条
[1]  
[Anonymous], 1956, Scandinavian Actuarial Journal
[2]   Likelihood ratio tests under local and fixed alternatives in monotone function problems [J].
Banerjee, M .
SCANDINAVIAN JOURNAL OF STATISTICS, 2005, 32 (04) :507-525
[3]   Confidence intervals for current status data [J].
Banerjee, M ;
Wellner, JA .
SCANDINAVIAN JOURNAL OF STATISTICS, 2005, 32 (03) :405-424
[4]  
BANERJEE M., 2019, DIVIDE CONQUER NON S, DOI [10.1214/17-AOS1633SUPP, DOI 10.1214/17-AOS1633SUPP]
[5]  
Banerjee M, 2008, STAT SINICA, V18, P467
[6]   Confidence sets for split points in decision trees [J].
Banerjee, Moulinath ;
Mckeague, Ian W. .
ANNALS OF STATISTICS, 2007, 35 (02) :543-574
[7]   Likelihood based inference for monotone response models [J].
Banerjee, Moulinath .
ANNALS OF STATISTICS, 2007, 35 (03) :931-956
[8]  
Brown LD, 1997, ANN STAT, V25, P2607
[9]   MAXIMUM LIKELIHOOD ESTIMATES OF MONOTONE PARAMETERS [J].
BRUNK, HD .
ANNALS OF MATHEMATICAL STATISTICS, 1955, 26 (04) :607-616
[10]  
Brunk HD, 1970, NONPARAMETRIC TECHNI, P195