WGRLR: A Weighted Group Regularized Logistic Regression for Cancer Diagnosis and Gene Selection

被引:7
作者
Song, Xuekun [1 ]
Liang, Ke [2 ]
Li, Juntao [2 ]
机构
[1] Henan Univ Chinese Med, Sch Informat Technol, Zhengzhou 450046, Peoples R China
[2] Henan Normal Univ, Sch Math & Informat Sci, Xinxiang 453007, Peoples R China
关键词
Noise reduction; gene grouping; cancer diagnosis; gene selection; VARIABLE SELECTION; ROBUST REGRESSION; GROUP LASSO; EXPRESSION; CLASSIFICATION; PREDICTION; VALIDATION; SHRINKAGE;
D O I
10.1109/TCBB.2022.3203167
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Sparse regressions applied to cancer diagnosis suffer from noise reduction, gene grouping, and group significance evaluation. This paper presented the weighted group regularized logistic regression (WGRLR) for dealing with the above problems. Clean data was separated from noisy gene expression profile data, based on which gene grouping and model building were performed. An interpretable gene group significance evaluation criterion was proposed based on symmetrical uncertainty and module eigengene. A group-wise individual gene significance evaluation criterion was also presented. The performances of the proposed method were compared with WGGL, ASGL-CMI, SGL, GL, Elastic Net, and lasso on acute leukemia and brain cancer data. Experimental results demonstrate that the proposed method is superior to the other six methods in cancer diagnosis accuracy and gene selection.
引用
收藏
页码:1563 / 1573
页数:11
相关论文
共 42 条
[11]   How high is the level of technical noise in microarray data? [J].
Klebanov, Lev ;
Yakovlev, Andrei .
BIOLOGY DIRECT, 2007, 2 (1)
[12]   Robust regression through the Huber's criterion and adaptive lasso penalty [J].
Lambert-Lacroix, Sophie ;
Zwald, Laurent .
ELECTRONIC JOURNAL OF STATISTICS, 2011, 5 :1015-1053
[13]   WGCNA: an R package for weighted correlation network analysis [J].
Langfelder, Peter ;
Horvath, Steve .
BMC BIOINFORMATICS, 2008, 9 (1)
[14]   On the Adversarial Robustness of LASSO Based Feature Selection [J].
Li, Fuwei ;
Lai, Lifeng ;
Cui, Shuguang .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2021, 69 :5555-5567
[15]   Gene selection of rat hepatocyte proliferation using adaptive sparse group lasso with weighted gene co-expression network analysis [J].
Li, Juntao ;
Wang, Yadi ;
Xiao, Huimin ;
Xu, Cunshuan .
COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2019, 80 :364-373
[16]   Grouped Gene Selection of Cancer via Adaptive Sparse Group Lasso Based on Conditional Mutual Information [J].
Li, Juntao ;
Dong, Wenpeng ;
Meng, Deyuan .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2018, 15 (06) :2028-2038
[17]   Adaptive multinomial regression with overlapping groups for multi-class classification of lung cancer [J].
Li, Juntao ;
Wang, Yanyan ;
Song, Xuekun ;
Xiao, Huimin .
COMPUTERS IN BIOLOGY AND MEDICINE, 2018, 100 :1-9
[18]   Cancer Diagnosis Through IsomiR Expression with Machine Learning Method [J].
Liao, Zhijun ;
Li, Dapeng ;
Wang, Xinrui ;
Li, Lisheng ;
Zou, Quan .
CURRENT BIOINFORMATICS, 2018, 13 (01) :57-63
[19]   RPCA-Based Tumor Classification Using Gene Expression Data [J].
Liu, Jin-Xing ;
Xu, Yong ;
Zheng, Chun-Hou ;
Kong, Heng ;
Lai, Zhi-Hui .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2015, 12 (04) :964-970
[20]   Kernel based methods for accelerated failure time model with ultra-high dimensional data [J].
Liu, Zhenqiu ;
Chen, Dechang ;
Tan, Ming ;
Jiang, Feng ;
Gartenhaus, Ronald B. .
BMC BIOINFORMATICS, 2010, 11