Nonparametric additive model with grouped lasso and maximizing area under the ROC curve

被引:3
作者
Choi, Sungwoo [1 ]
Park, Junyong [1 ]
机构
[1] Univ Maryland Baltimore Cty, Dept Math & Stat, Baltimore, MD 21250 USA
关键词
ROC curve; AUC; Additive model; Variable selection; High dimension; WAVELET APPROXIMATIONS; MICROARRAY DATA; REGRESSION; SELECTION; REGULARIZATION; CLASSIFICATION; AUC;
D O I
10.1016/j.csda.2014.03.010
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
An ROC (Receiver Operating Characteristic) curve is a popular tool in the classification of two populations. The nonparametric additive model is used to construct a classifier which is estimated by maximizing the U-statistic type of empirical AUC (Area Under Curve). In particular, the sparsity situation is considered in the sense that only a small number of variables is significant in the classification, so it is demanded that lots of noisy variables will be removed. Some theoretical result on the necessity of variable selection under the sparsity condition is provided since the AUC of the classifier from maximization of empirical AUC is not guaranteed to be optimal. To select significant variables in the classification, the grouped lasso which has been widely used when groups of parameters need to be either selected or discarded simultaneously is used. In addition, the performance of the proposed method is evaluated by numerical studies including simulation and real data examples compared with other existing approaches. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:313 / 325
页数:13
相关论文
共 22 条
[1]  
[Anonymous], 2006, Journal of the Royal Statistical Society, Series B
[2]  
[Anonymous], P INT MULT COMP SCI
[3]  
[Anonymous], 1999, Ph.D. thesis)
[4]  
[Anonymous], 2004, TECHNICAL REPORT
[5]   Regularization of wavelet approximations - Rejoinder [J].
Antoniadis, A ;
Fan, J .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (455) :964-967
[6]  
Ataman K, 2006, IEEE IJCNN, P123
[7]  
Cai TT, 2001, J AM STAT ASSOC, V96, P960
[8]   Least angle regression - Rejoinder [J].
Efron, B ;
Hastie, T ;
Johnstone, I ;
Tibshirani, R .
ANNALS OF STATISTICS, 2004, 32 (02) :494-499
[9]  
Friedman J., 2010, ARXIV10010736VIMATHS
[10]  
Gordon GJ, 2002, CANCER RES, V62, P4963