Structural regularization in quadratic logistic regression model

被引:10
作者
Jiang, He [1 ,2 ]
Dong, Yao [1 ,2 ]
机构
[1] Jiangxi Univ Finance & Econ, Sch Stat, Nanchang 330013, Jiangxi, Peoples R China
[2] Jiangxi Univ Finance & Econ, Appl Stat Res Ctr, Nanchang 330013, Jiangxi, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Quadratic model; Heredity principle; Classification; C-GRESH; Oracle property; VARIABLE SELECTION; LASSO; ALGORITHM; DIMENSION; TUMOR;
D O I
10.1016/j.knosys.2018.10.012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In statistical modeling, quadratic model including both main effects and interactions has drawn a large deal of attentions from researchers in many scientific fields. Researchers have found that it is extremely significant to maintain the heredity principle such as strong or weak heredity principle among variables when demanding sparsity in quadratic model. The reason why heredity principle is preferred is that model following logic structure is invariant to any scale transformation and is more stable when implementing forecasting and classification task. Although a large bore have studied quadratic models, most of them focus on the model performances in a regression problem and no systematically comparison are made in terms of classification accuracy. This paper investigates and studies group regularized estimation under structural hierarchy for classification (C-GRESH). In computation, a fast and simple-to-implement algorithm is designed with theoretical guarantee of its convergence. Furthermore, an accelerated gradient method is applied to speed up the convergence. Theoretically, we have shown the adaptive version of the proposed approach is able to achieve oracle property which includes asymptotic normality and model selection consistency. Simulation examples and real data examples including microarray gene expression datasets are shown to demonstrate the efficiency and superior performances of the proposed method over other existing competitors. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:842 / 857
页数:16
相关论文
共 48 条
[1]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[2]   Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays [J].
Alon, U ;
Barkai, N ;
Notterman, DA ;
Gish, K ;
Ybarra, S ;
Mack, D ;
Levine, AJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) :6745-6750
[3]  
[Anonymous], 2007, TECH REP
[4]  
[Anonymous], FOUND TRENDS MACH LE
[5]  
[Anonymous], ITALIAN J STAT
[6]  
[Anonymous], 2006, J ROYAL STAT SOC B
[7]  
[Anonymous], 1989, GEN LINEAR MODELS
[8]  
Bauschke HH, 2008, PAC J OPTIM, V4, P383
[9]   Square-root lasso: pivotal recovery of sparse signals via conic programming [J].
Belloni, A. ;
Chernozhukov, V. ;
Wang, L. .
BIOMETRIKA, 2011, 98 (04) :791-806
[10]  
Bickel PeterJ., 2010, Borrowing strength: theory powering applications-a Festschrift for Lawrence D. Brown, P56