Regularized logistic regression with adjusted adaptive elastic net for gene selection in high dimensional cancer classification

被引:79
作者
Algamal, Zakariya Yahya [1 ]
Lee, Muhammad Hisyam [1 ]
机构
[1] Univ Teknol Malaysia, Dept Math Sci, Skudai 81310, Johor, Malaysia
关键词
Adaptive elastic net; Oracle property; Regularized logistic regression; Cancer classification; Gene selection; VARIABLE SELECTION; LASSO; PREDICTION; ALGORITHM; PENALTY;
D O I
10.1016/j.compbiomed.2015.10.008
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Cancer classification and gene selection in high-dimensional data have been popular research topics in genetics and molecular biology. Recently, adaptive regularized logistic regression using the elastic net regularization, which is called the adaptive elastic net, has been successfully applied in high-dimensional cancer classification to tackle both estimating the gene coefficients and performing gene selection simultaneously. The adaptive elastic net originally used elastic net estimates as the initial weight, however, using this weight may not be preferable for certain reasons: First, the elastic net estimator is biased in selecting genes. Second, it does not perform well when the pairwise correlations between variables are not high. Adjusted adaptive regularized logistic regression (AAElastic) is proposed to address these issues and encourage grouping effects simultaneously. The real data results indicate that AAElastic is significantly consistent in selecting genes compared to the other three competitor regularization methods. Additionally, the classification performance of AAElastic is comparable to the adaptive elastic net and better than other regularization methods. Thus, we can conclude that AAElastic is a reliable adaptive regularized logistic regression method in the field of high-dimensional cancer classification. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:136 / 145
页数:10
相关论文
共 45 条
[1]   Penalized logistic regression with the adaptive LASSO for gene selection in high-dimensional cancer classification [J].
Algamal, Zakariya Yahya ;
Lee, Muhammad Hisyam .
EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (23) :9326-9332
[2]   Bayesian adaptive Lasso quantile regression [J].
Alhamzawi, Rahim ;
Yu, Keming ;
Benoit, Dries F. .
STATISTICAL MODELLING, 2012, 12 (03) :279-297
[3]   Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays [J].
Alon, U ;
Barkai, N ;
Notterman, DA ;
Gish, K ;
Ybarra, S ;
Mack, D ;
Levine, AJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) :6745-6750
[4]   Regularized logistic regression without a penalty term: An application to cancer classification with microarray data [J].
Bielza, Concha ;
Robles, Victor ;
Larranaga, Pedro .
EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (05) :5110-5118
[5]   Correlated variables in regression: Clustering and sparse estimation [J].
Buehlmann, Peter ;
Ruetimann, Philipp ;
van de Geer, Sara ;
Zhang, Cun-Hui .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2013, 143 (11) :1835-1858
[6]   Gene selection in cancer classification using sparse logistic regression with Bayesian regularization [J].
Cawley, Gavin C. ;
Talbot, Nicola L. C. .
BIOINFORMATICS, 2006, 22 (19) :2348-2355
[7]   An efficient statistical feature selection approach for classification of gene expression data [J].
Chandra, B. ;
Gupta, Manish .
JOURNAL OF BIOMEDICAL INFORMATICS, 2011, 44 (04) :529-535
[8]   Adaptive Elastic-Net Sparse Principal Component Analysis for Pathway Association Testing [J].
Chen, Xi .
STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2011, 10 (01)
[9]   Sparse maximum margin discriminant analysis for feature extraction and gene selection on gene expression data [J].
Cui, Yan ;
Zheng, Chun-Hou ;
Yang, Jian ;
Sha, Wen .
COMPUTERS IN BIOLOGY AND MEDICINE, 2013, 43 (07) :933-941
[10]   Ridge Regression in Prediction Problems: Automatic Choice of the Ridge Parameter [J].
Cule, Erika ;
De Iorio, Maria .
GENETIC EPIDEMIOLOGY, 2013, 37 (07) :704-714