机构:
Univ London London Sch Econ & Polit Sci, Dept Stat, London WC2A 2AE, EnglandUniv London London Sch Econ & Polit Sci, Dept Stat, London WC2A 2AE, England
Cho, Haeran
[1
]
论文数: 引用数:
h-index:
机构:
Fryzlewicz, Piotr
[1
]
机构:
[1] Univ London London Sch Econ & Polit Sci, Dept Stat, London WC2A 2AE, England
Adaptivity;
Correlation;
Hard thresholding;
High dimensionality;
Linear regression;
Variable selection;
NONCONCAVE PENALIZED LIKELIHOOD;
MODEL SELECTION;
REGRESSION;
LASSO;
CLASSIFICATION;
DISCOVERY;
D O I:
10.1111/j.1467-9868.2011.01023.x
中图分类号:
O21 [概率论与数理统计];
C8 [统计学];
学科分类号:
020208 ;
070103 ;
0714 ;
摘要:
. The paper considers variable selection in linear regression models where the number of covariates is possibly much larger than the number of observations. High dimensionality of the data brings in many complications, such as (possibly spurious) high correlations between the variables, which result in marginal correlation being unreliable as a measure of association between the variables and the response. We propose a new way of measuring the contribution of each variable to the response which takes into account high correlations between the variables in a data-driven way. The proposed tilting procedure provides an adaptive choice between the use of marginal correlation and tilted correlation for each variable, where the choice is made depending on the values of the hard thresholded sample correlation of the design matrix. We study the conditions under which this measure can successfully discriminate between the relevant and the irrelevant variables and thus be used as a tool for variable selection. Finally, an iterative variable screening algorithm is constructed to exploit the theoretical properties of tilted correlation, and its good practical performance is demonstrated in a comparative simulation study.
机构:
Univ Paris Sud, Univ Paris Saclay, High Dimens Biostat Drug Safety & Genom, UVSQ,Inserm,CESP, Villejuif, FranceUniv Paris Sud, Univ Paris Saclay, High Dimens Biostat Drug Safety & Genom, UVSQ,Inserm,CESP, Villejuif, France
Pluntz, Matthieu
Dalmasso, Cyril
论文数: 0引用数: 0
h-index: 0
机构:
Univ Evry Val Essonne, Lab Math & Modelisat Evry LaMME, Evry, FranceUniv Paris Sud, Univ Paris Saclay, High Dimens Biostat Drug Safety & Genom, UVSQ,Inserm,CESP, Villejuif, France
Dalmasso, Cyril
Tubert-Bitter, Pascale
论文数: 0引用数: 0
h-index: 0
机构:
Univ Paris Sud, Univ Paris Saclay, High Dimens Biostat Drug Safety & Genom, UVSQ,Inserm,CESP, Villejuif, FranceUniv Paris Sud, Univ Paris Saclay, High Dimens Biostat Drug Safety & Genom, UVSQ,Inserm,CESP, Villejuif, France
Tubert-Bitter, Pascale
Ahmed, Ismail
论文数: 0引用数: 0
h-index: 0
机构:
Univ Paris Sud, Univ Paris Saclay, High Dimens Biostat Drug Safety & Genom, UVSQ,Inserm,CESP, Villejuif, FranceUniv Paris Sud, Univ Paris Saclay, High Dimens Biostat Drug Safety & Genom, UVSQ,Inserm,CESP, Villejuif, France
机构:
Mem Sloan Kettering Canc Ctr, Dept Epidemiol & Biostat, New York, NY 10021 USAMem Sloan Kettering Canc Ctr, Dept Epidemiol & Biostat, New York, NY 10021 USA
Capanu, Marinela
Giurcanu, Mihai
论文数: 0引用数: 0
h-index: 0
机构:
Univ Chicago, Dept Publ Hlth Sci, Chicago, IL 60637 USAMem Sloan Kettering Canc Ctr, Dept Epidemiol & Biostat, New York, NY 10021 USA
机构:
Sun Yat Sen Univ, Southern China Res Ctr Stat Sci, Guangzhou, Guangdong, Peoples R China
Sun Yat Sen Univ, Dept Stat Sci, Sch Math & Computat Sci, Guangzhou, Guangdong, Peoples R ChinaSun Yat Sen Univ, Southern China Res Ctr Stat Sci, Guangzhou, Guangdong, Peoples R China
Wen, Canhong
Wang, Xueqin
论文数: 0引用数: 0
h-index: 0
机构:
Sun Yat Sen Univ, Dept Stat Sci, Sch Math & Computat Sci, Southern China Res Ctr Stat Sci, Guangzhou, Guangdong, Peoples R China
Sun Yat Sen Univ, Zhongshan Sch Med, Guangzhou, Guangdong, Peoples R ChinaSun Yat Sen Univ, Southern China Res Ctr Stat Sci, Guangzhou, Guangdong, Peoples R China
Wang, Xueqin
Wang, Shaoli
论文数: 0引用数: 0
h-index: 0
机构:
Shanghai Univ Finance & Econ, Sch Stat & Management, Shanghai, Peoples R ChinaSun Yat Sen Univ, Southern China Res Ctr Stat Sci, Guangzhou, Guangdong, Peoples R China
机构:
Univ Pablo Olavide, Area Lenguajes & Sistemas Informat, Seville 41013, SpainUniv Pablo Olavide, Area Lenguajes & Sistemas Informat, Seville 41013, Spain
Garcia-Torres, Miguel
Gomez-Vela, Francisco
论文数: 0引用数: 0
h-index: 0
机构:
Univ Pablo Olavide, Area Lenguajes & Sistemas Informat, Seville 41013, SpainUniv Pablo Olavide, Area Lenguajes & Sistemas Informat, Seville 41013, Spain
Gomez-Vela, Francisco
Melian-Batista, Belen
论文数: 0引用数: 0
h-index: 0
机构:
Univ La Laguna, Dept Ingn Informat & Sistemas, San Cristobal la Laguna 38271, SpainUniv Pablo Olavide, Area Lenguajes & Sistemas Informat, Seville 41013, Spain
Melian-Batista, Belen
Marcos Moreno-Vega, J.
论文数: 0引用数: 0
h-index: 0
机构:
Univ La Laguna, Dept Ingn Informat & Sistemas, San Cristobal la Laguna 38271, SpainUniv Pablo Olavide, Area Lenguajes & Sistemas Informat, Seville 41013, Spain