Gene Selection in Cancer Classification Using Sparse Logistic Regression with L1/2 Regularization

被引:17
作者
Wu, Shengbing [1 ]
Jiang, Hongkun [1 ]
Shen, Haiwei [1 ]
Yang, Ziyi [1 ]
机构
[1] Macau Univ Sci & Technol, Fac Informat Technol, Macau 999078, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2018年 / 8卷 / 09期
关键词
gene selection; cancer classification; regularized logistic regression; L-1/2; regularization; CELL LUNG-CANCER; VARIABLE SELECTION; EGFR MUTATION; LASSO;
D O I
10.3390/app8091569
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In recent years, gene selection for cancer classification based on the expression of a small number of gene biomarkers has been the subject of much research in genetics and molecular biology. The successful identification of gene biomarkers will help in the classification of different types of cancer and improve the prediction accuracy. Recently, regularized logistic regression using the L-1 regularization has been successfully applied in high-dimensional cancer classification to tackle both the estimation of gene coefficients and the simultaneous performance of gene selection. However, the L-1 has a biased gene selection and dose not have the oracle property. To address these problems, we investigate L-1/2 regularized logistic regression for gene selection in cancer classification. Experimental results on three DNA microarray datasets demonstrate that our proposed method outperforms other commonly used sparse methods (L-1 and L-EN) in terms of classification performance.
引用
收藏
页数:12
相关论文
共 34 条
[1]   Penalized logistic regression with the adaptive LASSO for gene selection in high-dimensional cancer classification [J].
Algamal, Zakariya Yahya ;
Lee, Muhammad Hisyam .
EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (23) :9326-9332
[2]   Automatic Feature Selection via Weighted Kernels and Regularization [J].
Allen, Genevera I. .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2013, 22 (02) :284-299
[3]  
[Anonymous], 2002, APPL LOGISTIC REGRES
[4]   A TWO-SAMPLE TEST FOR HIGH-DIMENSIONAL DATA WITH APPLICATIONS TO GENE-SET TESTING [J].
Chen, Song Xi ;
Qin, Ying-Li .
ANNALS OF STATISTICS, 2010, 38 (02) :808-835
[5]   Gene selection with guided regularized random forest [J].
Deng, Houtao ;
Runger, George .
PATTERN RECOGNITION, 2013, 46 (12) :3483-3489
[6]   Variable selection via nonconcave penalized likelihood and its oracle properties [J].
Fan, JQ ;
Li, RZ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1348-1360
[7]   The LASSO and Sparse Least Squares Regression Methods for SNP Selection in Predicting Quantitative Traits [J].
Feng, Zeny Z. ;
Yang, Xiaojian ;
Subedi, Sanjeena ;
McNicholas, Paul D. .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2012, 9 (02) :629-636
[8]   Regularization Paths for Generalized Linear Models via Coordinate Descent [J].
Friedman, Jerome ;
Hastie, Trevor ;
Tibshirani, Rob .
JOURNAL OF STATISTICAL SOFTWARE, 2010, 33 (01) :1-22
[9]  
Harrell FE, 2015, SPRINGER SER STAT, P311, DOI 10.1007/978-3-319-19425-7_13
[10]   Computational procedures for probing interactions in OLS and logistic regression: SPSS and SAS implementations [J].
Hayes, Andrew F. ;
Matthes, Joerg .
BEHAVIOR RESEARCH METHODS, 2009, 41 (03) :924-936