Weighted General Group Lasso for Gene Selection in Cancer Classification

被引:46
作者
Wang, Yadi [1 ,2 ]
Li, Xiaoping [1 ,2 ]
Ruiz, Ruben [3 ]
机构
[1] Southeast Univ, Sch Comp Sci & Engn, Nanjing 211189, Jiangsu, Peoples R China
[2] Southeast Univ, Key Lab Comp Network & Informat Integrat, Minist Educ, Nanjing 211189, Jiangsu, Peoples R China
[3] Univ Politecn Valencia, Grp Sistemas Optimizac Aplicada, Inst Tecnol Informat, Ciudad Politecn Innovat, Valencia 46021, Spain
基金
中国国家自然科学基金;
关键词
Cancer classification; gene selection; group lasso; heuristic; joint mutual information; SUPPORT VECTOR MACHINES; LOGISTIC-REGRESSION; ADAPTIVE LASSO; PREDICTION; ALGORITHMS; DISCOVERY;
D O I
10.1109/TCYB.2018.2829811
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Relevant gene selection is crucial for analyzing cancer gene expression datasets including two types of tumors in cancer classification. Intrinsic interactions among selected genes cannot be fully identified by most existing gene selection methods. In this paper, we propose a weighted general group lasso (WGGL) model to select cancer genes in groups. A gene grouping heuristic method is presented based on weighted gene co-expression network analysis. To determine the importance of genes and groups, a method for calculating gene and group weights is presented in terms of joint mutual information. To implement the complex calculation process of WGGL, a gene selection algorithm is developed. Experimental results on both random and three cancer gene expression datasets demonstrate that the proposed model achieves better classification performance than two existing state-of-the-art gene selection methods.
引用
收藏
页码:2860 / 2873
页数:14
相关论文
共 47 条
  • [1] Penalized logistic regression with the adaptive LASSO for gene selection in high-dimensional cancer classification
    Algamal, Zakariya Yahya
    Lee, Muhammad Hisyam
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (23) : 9326 - 9332
  • [2] [Anonymous], 2006, J ROYAL STAT SOC B
  • [3] Weighted LAD-LASSO method for robust parameter estimation and variable selection in regression
    Arslan, Olcay
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2012, 56 (06) : 1952 - 1965
  • [4] Reverse engineering of regulatory networks in human B cells
    Basso, K
    Margolin, AA
    Stolovitzky, G
    Klein, U
    Dalla-Favera, R
    Califano, A
    [J]. NATURE GENETICS, 2005, 37 (04) : 382 - 390
  • [5] Feature selection using Joint Mutual Information Maximisation
    Bennasar, Mohamed
    Hicks, Yulia
    Setchi, Rossitza
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (22) : 8520 - 8532
  • [6] Brown G, 2012, J MACH LEARN RES, V13, P27
  • [7] Gene selection in cancer classification using sparse logistic regression with Bayesian regularization
    Cawley, Gavin C.
    Talbot, Nicola L. C.
    [J]. BIOINFORMATICS, 2006, 22 (19) : 2348 - 2355
  • [8] Cover TM., 1991, Elements of information theory
  • [9] Bi-level variable selection via adaptive sparse group Lasso
    Fang, Kuangnan
    Wang, Xiaoyan
    Zhang, Shengwei
    Zhu, Jianping
    Ma, Shuangge
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2015, 85 (13) : 2750 - 2760
  • [10] Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring
    Golub, TR
    Slonim, DK
    Tamayo, P
    Huard, C
    Gaasenbeek, M
    Mesirov, JP
    Coller, H
    Loh, ML
    Downing, JR
    Caligiuri, MA
    Bloomfield, CD
    Lander, ES
    [J]. SCIENCE, 1999, 286 (5439) : 531 - 537