Segmentation of the mean of heteroscedastic data via cross-validation

被引：0

作者：

Sylvain Arlot

Alain Celisse

机构：

[1] Willow Project-Team,

[2] Laboratoire d’Informatique de l’Ecole Normale Superieure,undefined

[3] CNRS/ENS/INRIA UMR 8548,undefined

[4] Laboratoire de Mathématique Paul Painlevé UMR 8524 CNRS,undefined

[5] Université Lille 1,undefined

来源：

Statistics and Computing | 2011年 / 21卷

关键词：

Change-point detection; Resampling; Cross-validation; Model selection; Heteroscedastic data; CGH profile segmentation;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

This paper tackles the problem of detecting abrupt changes in the mean of a heteroscedastic signal by model selection, without knowledge on the variations of the noise. A new family of change-point detection procedures is proposed, showing that cross-validation methods can be successful in the heteroscedastic framework, whereas most existing procedures are not robust to heteroscedasticity. The robustness to heteroscedasticity of the proposed procedures is supported by an extensive simulation study, together with recent partial theoretical results. An application to Comparative Genomic Hybridization (CGH) data is provided, showing that robustness to heteroscedasticity can indeed be required for their analysis.

引用

页码：613 / 632

页数：19

共 76 条

[1] Abramovich F.(2006)Adapting to unknown sparsity by controlling the false discovery rate Ann. Stat. 34 584-653
[2] Benjamini Y.(1970)Statistical predictor identification Ann. Inst. Stat. Math. 22 203-217
[3] Donoho D.L.(1974)The relationship between variable selection and data augmentation and a method for prediction Technometrics 16 125-127
[4] Johnstone I.M.(2009)Model selection by resampling penalization Electron. J. Stat. 3 557-624
[5] Akaike H.(2010)A survey of cross-validation procedures for model selection Stat. Surv. 4 40-79
[6] Allen D.M.(2009)Data-driven calibration of penalties for least-squares regression J. Mach. Learn. Res. 10 245-279
[7] Arlot S.(2000)Model selection for regression on a fixed design Probab. Theory Relat. Fields 117 467-493
[8] Arlot S.(2002)Model selection for regression on a random design ESAIM Probab. Stat. 6 127-146
[9] Celisse A.(2009)Gaussian model selection with an unknown variance Ann. Stat. 37 630-672
[10] Arlot S.(1999)Risk bounds for model selection via penalization Probab. Theory Relat. Fields 113 301-413

← 1 2 3 4 5 6 7 8 →