A nonparametric feature screening method for ultrahigh-dimensional missing response

被引:9
|
作者
Li, Xiaoxia [1 ,2 ]
Tang, Niansheng [1 ,2 ]
Xie, Jinhan [1 ,2 ]
Yan, Xiaodong [1 ,2 ]
机构
[1] Yunnan Univ, Yunnan Key Lab Stat Modeling & Data Anal, Kunming 650500, Yunnan, Peoples R China
[2] Shandong Univ, Sch Econ, Jinan 250100, Shandong, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature screening; Imputation; Marginal Spearman rank correlation; Missing at random; Ultrahigh-dimensional data; VARIABLE SELECTION; KOLMOGOROV FILTER; MODEL SELECTION; LINEAR-MODELS; LIKELIHOOD; SURVIVAL;
D O I
10.1016/j.csda.2019.106828
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper addresses the feature screening issue for ultrahigh-dimensional data with responses missing at random. A novel nonparametric feature screening procedure is developed to identify the important features via the conditionally imputing marginal Spearman rank correlation. The proposed nonparametric screening approach has several desirable merits. First, it is nonparametric without assuming any regression form of predictors on response variable. Second, it is robust to outliers and heavy-tailed data. Third, under some regularity conditions, it is shown that the proposed feature screening procedure has the sure screening and ranking consistency properties. Simulation studies evidence that the proposed screening procedure outperforms several existing model-free screening procedures. An example taken from the microarray diffuse large-B-cell lymphoma study is used to illustrate the proposed methodologies. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Independent feature screening for ultrahigh-dimensional models with interactions
    Song, Yunquan
    Zhu, Xuehu
    Lin, Lu
    JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2014, 43 (04) : 567 - 583
  • [22] A selective overview of feature screening for ultrahigh-dimensional data
    JingYuan Liu
    Wei Zhong
    RunZe Li
    Science China Mathematics, 2015, 58 : 1 - 22
  • [23] Independent feature screening for ultrahigh-dimensional models with interactions
    Yunquan Song
    Xuehu Zhu
    Lu Lin
    Journal of the Korean Statistical Society, 2014, 43 : 567 - 583
  • [24] Spearman Rank Correlation Screening for Ultrahigh-Dimensional Censored Data
    Wang, Hongni
    Yan, Jingxin
    Yan, Xiaodong
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 10104 - 10112
  • [25] A Robust Model-Free Feature Screening Method for Ultrahigh-Dimensional Data
    Xue, Jingnan
    Liang, Faming
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2017, 26 (04) : 803 - 813
  • [26] Interaction Screening for Ultrahigh-Dimensional Data
    Hao, Ning
    Zhang, Hao Helen
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2014, 109 (507) : 1285 - 1301
  • [27] Quantile-Composited Feature Screening for Ultrahigh-Dimensional Data
    Chen, Shuaishuai
    Lu, Jun
    MATHEMATICS, 2023, 11 (10)
  • [28] FEATURE SCREENING VIA DISTANCE CORRELATION FOR ULTRAHIGH DIMENSIONAL DATA WITH RESPONSES MISSING AT RANDOM
    Xia, Linli
    Tang, Niansheng
    STATISTICA SINICA, 2023, 33 : 1169 - 1191
  • [29] Feature screening based on distance correlation for ultrahigh-dimensional censored data with covariate measurement error
    Chen, Li-Pang
    COMPUTATIONAL STATISTICS, 2021, 36 (02) : 857 - 884
  • [30] Adjusted feature screening for ultra-high dimensional missing response
    Zou, Liying
    Liu, Yi
    Zhang, Zhonghu
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2024, 94 (03) : 460 - 483