Semiparametric estimation of a two-component mixture model where one component is known

被引:44
作者
Bordes, Laurent
Delmas, Celine
Vandekerkhove, Pierre
机构
[1] Univ Marne Vallee, LAMA, F-77454 Champs Sur Marne 2, Marne La Vallee, France
[2] Univ Technol Compiegne, LMAC, F-60206 Compiegne, France
关键词
identifiability; microarray data; mixture; multiple test hypothesis; sentiparametric; training data;
D O I
10.1111/j.1467-9469.2006.00515.x
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider a two-component mixture model where one component distribution is known while the mixing proportion and the other component distribution are unknown. These kinds of models were first introduced in biology to study the differences in expression between genes. The various estimation methods proposed till now have all assumed that the unknown distribution belongs to a parametric family. In this paper, we show how this assumption can be relaxed. First, we note that generally the above model is not identifiable, but we show that under moment and symmetry conditions some 'almost everywhere' identifiability results can be obtained. Where such identifiability conditions are fulfilled we propose an estimation method for the unknown parameters which is shown to be strongly consistent under mild conditions. We discuss applications of our method to microarray data analysis and to the training data problem. We compare our method to the parametric approach using simulated data and, finally, we apply our method to real data from microarray experiments.
引用
收藏
页码:733 / 752
页数:20
相关论文
共 17 条
[1]  
[Anonymous], 2000, WILEY SERIES PROBABI
[2]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[3]  
BORDES L, 2006, IN PRESS ANN STAT, V34
[4]  
Bosq D., 1987, THEORIE ESTIMATION F
[5]   Nonparametric estimation in semi-parametric univariate mixture models [J].
Cruz-Medina, IR ;
Hettmansperger, TP .
JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2004, 74 (07) :513-524
[6]   THE EQUIVALENCE OF WEAK, STRONG AND COMPLETE CONVERGENCE IN L1 FOR KERNEL DENSITY ESTIMATES [J].
DEVROYE, L .
ANNALS OF STATISTICS, 1983, 11 (03) :896-904
[7]  
Dudoit S, 2002, STAT SINICA, V12, P111
[8]   Correction of density estimators that are not densities [J].
Glad, IK ;
Hjort, NL ;
Ushakov, NG .
SCANDINAVIAN JOURNAL OF STATISTICS, 2003, 30 (02) :415-427
[9]  
HALL P, 1981, J ROY STAT SOC B MET, V43, P147
[10]  
Hall P, 2003, ANN STAT, V31, P201