BS-SIM: An effective variable selection method for high-dimensional single index model

被引:8
作者
Cheng, Longjie [1 ]
Zeng, Peng [2 ]
Zhu, Yu [1 ,3 ]
机构
[1] Purdue Univ, Dept Stat, W Lafayette, IN 47907 USA
[2] Auburn Univ, Dept Math & Stat, Auburn, AL 36849 USA
[3] Tsinghua Unvers, Dept Ind Engn, Ctr Stat Sci, Beijing, Peoples R China
来源
ELECTRONIC JOURNAL OF STATISTICS | 2017年 / 11卷 / 02期
基金
美国国家科学基金会;
关键词
Single index model; variable selection; regression spline; LASSO; SICA; TUNING PARAMETER SELECTION; PENALIZED LIKELIHOOD; REGRESSION SPLINES; ORACLE PROPERTIES; KNOT SELECTION; LEAST-SQUARES; LASSO; MELANOMA; PROGRESSION; SHRINKAGE;
D O I
10.1214/17-EJS1329
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The single index model is an intuitive extension of the linear regression model. It has become increasingly popular due to its flexibility in modeling. Similar to the linear regression model, the set of predictors for the single index model can contain a large number of irrelevant variables. Therefore, it is important to select the relevant variables when fitting the single index model. However, the problem of variable selection for high-dimensional single index model is not well settled in the literature. In this work, we combine the idea of applying cubic B-splines for estimating the single index model with the idea of using the family of the smooth integration of counting and absolute deviation (SICA) penalty functions for variable selection. We propose a new method to simultaneously perform parameter estimation and model selection for the single index model. This method is referred to as the B-spline and SICA method for the single index model, or in short, BS-SIM. We develop a coordinate descent algorithm to efficiently implement BS-SIM. We also show that under certain conditions, the proposed method can consistently estimate the true index and select the true model. Simulations with various settings and a real data analysis are conducted to demonstrate the estimation accuracy, selection consistency and computational efficiency of BS-SIM.
引用
收藏
页码:3522 / 3548
页数:27
相关论文
共 32 条
  • [1] Candes E, 2007, ANN STAT, V35, P2313, DOI 10.1214/009053606000001523
  • [2] Cheng L., 2017, SUPPLEMENTARY MAT BS, DOI [10.1214/17-EJS1329SUPP, DOI 10.1214/17-EJS1329SUPP]
  • [3] DE BOOR C, 2001, PRACTICAL GUIDE SPLI
  • [4] Variable selection via nonconcave penalized likelihood and its oracle properties
    Fan, JQ
    Li, RZ
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) : 1348 - 1360
  • [5] Tuning parameter selection in high dimensional penalized likelihood
    Fan, Yingying
    Tang, Cheng Yong
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2013, 75 (03) : 531 - 552
  • [6] INVESTIGATING SMOOTH MULTIPLE-REGRESSION BY THE METHOD OF AVERAGE DERIVATIVES
    HARDLE, W
    STOKER, TM
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1989, 84 (408) : 986 - 995
  • [7] Variable selection for the single-index model
    Kong, Efang
    Xia, Yingcun
    [J]. BIOMETRIKA, 2007, 94 (01) : 217 - 229
  • [8] Notch1 signaling promotes primary melanoma progression by activating mitogen-activated protein kinase/phosphatidylinositol 3-kinase-Akt pathways and up-regulating N-cadherin expression
    Liu, ZJ
    Xiao, M
    Balint, K
    Smalley, KSM
    Brafford, P
    Qiu, RH
    Pinnix, CC
    Li, XL
    Herlyn, M
    [J]. CANCER RESEARCH, 2006, 66 (08) : 4182 - 4190
  • [9] Restoring p53 Function in Human Melanoma Cells by Inhibiting MDM2 and Cyclin B1/CDK1-Phosphorylated Nuclear iASPP
    Lu, Min
    Breyssens, Hilde
    Salter, Victoria
    Zhong, Shan
    Hu, Ying
    Baer, Caroline
    Ratnayaka, Indrika
    Sullivan, Alex
    Brown, Nicholas R.
    Endicott, Jane
    Knapp, Stefan
    Kessler, Benedikt M.
    Middleton, Mark R.
    Siebold, Christian
    Jones, E. Yvonne
    Sviderskaya, Elena V.
    Cebon, Jonathan
    John, Thomas
    Caballero, Otavia L.
    Goding, Colin R.
    Lu, Xin
    [J]. CANCER CELL, 2013, 23 (05) : 618 - 633
  • [10] A UNIFIED APPROACH TO MODEL SELECTION AND SPARSE RECOVERY USING REGULARIZED LEAST SQUARES
    Lv, Jinchi
    Fan, Yingying
    [J]. ANNALS OF STATISTICS, 2009, 37 (6A) : 3498 - 3528