The concordance filter: an adaptive model-free feature screening procedure

被引:1
作者
Cheng, Xuewei [1 ,2 ,3 ]
Li, Gang [4 ]
Wang, Hong [3 ]
机构
[1] Hunan Normal Univ, Sch Math & Stat, MOE LCSM, Changsha, Peoples R China
[2] Hunan Normal Univ, Coll Hunan Prov, Key Lab Appl Stat & Data Sci, Changsha, Peoples R China
[3] Cent South Univ, Sch Math & Stat, Changsha, Peoples R China
[4] Univ Calif Los Angeles, Sch Publ Hlth, Los Angeles, CA USA
关键词
Concordance filter; Sure independent screening; High-dimensional data; Model-free; Data-adaptive; REGRESSION; SELECTION; RANK; INEQUALITIES;
D O I
10.1007/s00180-023-01399-5
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
A new model-free and data-adaptive feature screening procedure referred to as the concordance filter is developed for ultrahigh-dimensional data. The proposed method is based on the concordance filter which measures concordance between random vectors and can work adaptively with several types of predictors and response variables. We apply the concordance filter to deal with feature screening problems emerging from a wide range of real applications, such as nonparametric regression and survival analysis, among others. It is shown that the concordance filter enjoys the sure screening and rank consistency properties under weak regularity conditions. In particular, the concordance filter can still be powerful in the presence of censoring and heavy tails. We further demonstrate the superior performance of the concordance filter over existing screening methods by numerical examples and medical applications.
引用
收藏
页码:2413 / 2436
页数:24
相关论文
共 44 条
[1]   A KNOCKOFF FILTER FOR HIGH-DIMENSIONAL SELECTIVE INFERENCE [J].
Barber, Rina Foygel ;
Candes, Emmanuel J. .
ANNALS OF STATISTICS, 2019, 47 (05) :2504-2537
[2]   CONTROLLING THE FALSE DISCOVERY RATE VIA KNOCKOFFS [J].
Barber, Rina Foygel ;
Candes, Emmanuel J. .
ANNALS OF STATISTICS, 2015, 43 (05) :2055-2085
[3]   ADAPTIVE ESTIMATION OF THE RANK OF THE COEFFICIENT MATRIX IN HIGH-DIMENSIONAL MULTIVARIATE RESPONSE REGRESSION MODELS [J].
Bing, Xin ;
Wegkamp, Marten H. .
ANNALS OF STATISTICS, 2019, 47 (06) :3157-3184
[4]   Using the accelerated failure time model to analyze current status data with misclassified covariates [J].
Chen, Baojiang ;
Qin, Jing ;
Yuan, Ao .
ELECTRONIC JOURNAL OF STATISTICS, 2021, 15 (01) :1372-1394
[5]   MULTIVARIATE GENERALIZATIONS OF THE PROPORTIONAL HAZARDS MODEL [J].
CLAYTON, D ;
CUZICK, J .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 1985, 148 :82-117
[6]  
COX DR, 1972, J R STAT SOC B, V34, P187
[7]   Strong time dependence of the 76-gene prognostic signature for node-negative breast cancer patients in the TRANSBIG multicenter independent validation series [J].
Desmedt, Christine ;
Piette, Fanny ;
Loi, Sherene ;
Wang, Yixin ;
d'assignies, Mahasti Saghatchian ;
Bergh, Jonas ;
Lidereau, Rosette ;
Ellis, Paul ;
Harris, Adrian L. ;
Klijn, Jan G. M. ;
Foekens, John A. ;
Cardoso, Fatima ;
Piccart, Martine J. ;
Buyse, Marc ;
Sotiriou, Christos .
CLINICAL CANCER RESEARCH, 2007, 13 (11) :3207-3214
[8]  
Fan J., 2020, Statistical Foundations of Data Science
[9]   Sure independence screening for ultrahigh dimensional feature space [J].
Fan, Jianqing ;
Lv, Jinchi .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 :849-883
[10]   Nonparametric Independence Screening in Sparse Ultra-High-Dimensional Additive Models [J].
Fan, Jianqing ;
Feng, Yang ;
Song, Rui .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (494) :544-557