Robust rank screening for ultrahigh dimensional discriminant analysis

被引:11
作者
Cheng, Guosheng [1 ]
Li, Xingxiang [1 ]
Lai, Peng [1 ]
Song, Fengli [1 ]
Yu, Jun [2 ]
机构
[1] Nanjing Univ Informat Sci Technol, Sch Math & Stat, Nanjing 210044, Peoples R China
[2] Univ Vermont, Dept Math & Stat, Burlington, VT 05401 USA
基金
中国国家自然科学基金;
关键词
Feature screening; Robust property of rank; Sure screening property; Ultrahigh dimensional discriminant analysis; FISHERS LINEAR DISCRIMINANT; GENE-EXPRESSION; VARIABLES; CANCER; MODELS;
D O I
10.1007/s11222-016-9637-2
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we consider sure independence feature screening for ultrahigh dimensional discriminant analysis. We propose a new method named robust rank screening based on the conditional expectation of the rank of predictor's samples. We also establish the sure screening property for the proposed procedure under simple assumptions. The new procedure has some additional desirable characters. First, it is robust against heavy-tailed distributions, potential outliers and the sample shortage for some categories. Second, it is model-free without any specification of a regression model and directly applicable to the situation with many categories. Third, it is simple in theoretical derivation due to the boundedness of the resulting statistics. Forth, it is relatively inexpensive in computational cost because of the simple structure of the screening index. Monte Carlo simulations and real data examples are used to demonstrate the finite sample performance.
引用
收藏
页码:535 / 545
页数:11
相关论文
共 22 条
[1]  
Barrett T, 2005, NUCLEIC ACIDS RES, V33, pD562
[2]   Some theory for Fisher's linear discriminant function, 'naive Bayes', and some alternatives when there are many more variables than observations [J].
Bickel, PJ ;
Levina, E .
BERNOULLI, 2004, 10 (06) :989-1010
[3]   Sparse Discriminant Analysis [J].
Clemmensen, Line ;
Hastie, Trevor ;
Witten, Daniela ;
Ersboll, Bjarne .
TECHNOMETRICS, 2011, 53 (04) :406-413
[4]  
Cui H., 2014, J AM STAT ASS
[5]   Sure independence screening for ultrahigh dimensional feature space [J].
Fan, Jianqing ;
Lv, Jinchi .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 :849-883
[6]   Nonparametric Independence Screening in Sparse Ultra-High-Dimensional Additive Models [J].
Fan, Jianqing ;
Feng, Yang ;
Song, Rui .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (494) :544-557
[7]   SURE INDEPENDENCE SCREENING IN GENERALIZED LINEAR MODELS WITH NP-DIMENSIONALITY [J].
Fan, Jianqing ;
Song, Rui .
ANNALS OF STATISTICS, 2010, 38 (06) :3567-3604
[8]  
Fan JQ, 2009, J MACH LEARN RES, V10, P2013
[9]   HIGH-DIMENSIONAL CLASSIFICATION USING FEATURES ANNEALED INDEPENDENCE RULES [J].
Fan, Jianqing ;
Fan, Yingying .
ANNALS OF STATISTICS, 2008, 36 (06) :2605-2637
[10]  
Gordon GJ, 2002, CANCER RES, V62, P4963