Conditional quantile screening in ultrahigh-dimensional heterogeneous data

被引:92
作者
Wu, Yuanshan [1 ]
Yin, Guosheng [2 ]
机构
[1] Wuhan Univ, Sch Math & Stat, Wuhan 430072, Hubei, Peoples R China
[2] Univ Hong Kong, Dept Stat & Actuarial Sci, Hong Kong, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Censored data; Feature screening; Heterogeneity; Quantile regression; Sure independent screening; Transformation model; Ultrahigh dimensionality; Variable selection; VARIABLE SELECTION; MODELS;
D O I
10.1093/biomet/asu068
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
To accommodate the heterogeneity that is often present in ultrahigh-dimensional data, we propose a conditional quantile screening method, which enables us to select features that contribute to the conditional quantile of the response given the covariates. The method can naturally handle censored data by incorporating a weighting scheme through redistribution of the mass to the right; moreover, it is invariant to monotone transformation of the response and requires substantially weaker conditions than do alternative methods. We establish sure independent screening properties for both the complete and the censored response cases. We also conduct simulations to evaluate the finite-sample performance of the proposed method, and compare it with existing approaches.
引用
收藏
页码:65 / 76
页数:12
相关论文
共 22 条
[1]  
[Anonymous], 2012, J AM STAT ASSOC
[2]   Sure independence screening for ultrahigh dimensional feature space [J].
Fan, Jianqing ;
Lv, Jinchi .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 :849-883
[3]   Nonparametric Independence Screening in Sparse Ultra-High-Dimensional Additive Models [J].
Fan, Jianqing ;
Feng, Yang ;
Song, Rui .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (494) :544-557
[4]   SURE INDEPENDENCE SCREENING IN GENERALIZED LINEAR MODELS WITH NP-DIMENSIONALITY [J].
Fan, Jianqing ;
Song, Rui .
ANNALS OF STATISTICS, 2010, 38 (06) :3567-3604
[5]  
Fan JQ, 2009, J MACH LEARN RES, V10, P2013
[6]   Variable selection via nonconcave penalized likelihood and its oracle properties [J].
Fan, JQ ;
Li, RZ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1348-1360
[7]   Independent screening for single-index hazard rate models with ultrahigh dimensional features [J].
Gorst-Rasmussen, Anders ;
Scheike, Thomas .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2013, 75 (02) :217-245
[8]   A lack-of-fit test for quantile regression [J].
He, XM ;
Zhu, LX .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2003, 98 (464) :1013-1022
[9]   QUANTILE-ADAPTIVE MODEL-FREE VARIABLE SCREENING FOR HIGH-DIMENSIONAL HETEROGENEOUS DATA [J].
He, Xuming ;
Wang, Lan ;
Hong, Hyokyoung Grace .
ANNALS OF STATISTICS, 2013, 41 (01) :342-369
[10]   A consistent multivariate test of association based on ranks of distances [J].
Heller, Ruth ;
Heller, Yair ;
Gorfine, Malka .
BIOMETRIKA, 2013, 100 (02) :503-510