Model-free feature screening for ultrahigh-dimensional data conditional on some variables

被引:0
作者
Yi Liu
Qihua Wang
机构
[1] Chinese Academy of Sciences,Academy of Mathematics and Systems Science
[2] China University of Petroleum,College of Science
[3] Shenzhen University,Institute of Statistical Science
来源
Annals of the Institute of Statistical Mathematics | 2018年 / 70卷
关键词
Conditional distance correlation; Feature selection; Sure screening property; High-dimensional data;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, the conditional distance correlation (CDC) is used as a measure of correlation to develop a conditional feature screening procedure given some significant variables for ultrahigh-dimensional data. The proposed procedure is model free and is called conditional distance correlation-sure independence screening (CDC-SIS for short). That is, we do not specify any model structure between the response and the predictors, which is appealing in some practical problems of ultrahigh-dimensional data analysis. The sure screening property of the CDC-SIS is proved and a simulation study was conducted to evaluate the finite sample performances. Real data analysis is used to illustrate the proposed method. The results indicate that CDC-SIS performs well.
引用
收藏
页码:283 / 301
页数:18
相关论文
empty
未找到相关数据