WGRLR: A Weighted Group Regularized Logistic Regression for Cancer Diagnosis and Gene Selection

被引:7
作者
Song, Xuekun [1 ]
Liang, Ke [2 ]
Li, Juntao [2 ]
机构
[1] Henan Univ Chinese Med, Sch Informat Technol, Zhengzhou 450046, Peoples R China
[2] Henan Normal Univ, Sch Math & Informat Sci, Xinxiang 453007, Peoples R China
关键词
Noise reduction; gene grouping; cancer diagnosis; gene selection; VARIABLE SELECTION; ROBUST REGRESSION; GROUP LASSO; EXPRESSION; CLASSIFICATION; PREDICTION; VALIDATION; SHRINKAGE;
D O I
10.1109/TCBB.2022.3203167
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Sparse regressions applied to cancer diagnosis suffer from noise reduction, gene grouping, and group significance evaluation. This paper presented the weighted group regularized logistic regression (WGRLR) for dealing with the above problems. Clean data was separated from noisy gene expression profile data, based on which gene grouping and model building were performed. An interpretable gene group significance evaluation criterion was proposed based on symmetrical uncertainty and module eigengene. A group-wise individual gene significance evaluation criterion was also presented. The performances of the proposed method were compared with WGGL, ASGL-CMI, SGL, GL, Elastic Net, and lasso on acute leukemia and brain cancer data. Experimental results demonstrate that the proposed method is superior to the other six methods in cancer diagnosis accuracy and gene selection.
引用
收藏
页码:1563 / 1573
页数:11
相关论文
共 42 条
[1]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[2]   Exploring the new world of the genome with DNA microarrays [J].
Brown, PO ;
Botstein, D .
NATURE GENETICS, 1999, 21 (Suppl 1) :33-37
[3]   Robust Principal Component Analysis? [J].
Candes, Emmanuel J. ;
Li, Xiaodong ;
Ma, Yi ;
Wright, John .
JOURNAL OF THE ACM, 2011, 58 (03)
[4]   Gene selection in cancer classification using sparse logistic regression with Bayesian regularization [J].
Cawley, Gavin C. ;
Talbot, Nicola L. C. .
BIOINFORMATICS, 2006, 22 (19) :2348-2355
[5]   Identifying Methylation Pattern and Genes Associated with Breast Cancer Subtypes [J].
Chen, Lei ;
Zeng, Tao ;
Pan, Xiaoyong ;
Zhang, Yu-Hang ;
Huang, Tao ;
Cai, Yu-Dong .
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2019, 20 (17)
[6]   Bi-level variable selection via adaptive sparse group Lasso [J].
Fang, Kuangnan ;
Wang, Xiaoyan ;
Zhang, Shengwei ;
Zhu, Jianping ;
Ma, Shuangge .
JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2015, 85 (13) :2750-2760
[7]   Support vector machine classification and validation of cancer tissue samples using microarray expression data [J].
Furey, TS ;
Cristianini, N ;
Duffy, N ;
Bednarski, DW ;
Schummer, M ;
Haussler, D .
BIOINFORMATICS, 2000, 16 (10) :906-914
[8]   Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring [J].
Golub, TR ;
Slonim, DK ;
Tamayo, P ;
Huard, C ;
Gaasenbeek, M ;
Mesirov, JP ;
Coller, H ;
Loh, ML ;
Downing, JR ;
Caligiuri, MA ;
Bloomfield, CD ;
Lander, ES .
SCIENCE, 1999, 286 (5439) :531-537
[9]   A Faster cDNA Microarray Gene Expression Data Classifier for Diagnosing Diseases [J].
Hsieh, Sun-Yuan ;
Chou, Yu-Chun .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2016, 13 (01) :43-54
[10]   A Selective Review of Group Selection in High-Dimensional Models [J].
Huang, Jian ;
Breheny, Patrick ;
Ma, Shuangge .
STATISTICAL SCIENCE, 2012, 27 (04) :481-499