Lung Cancer Classification and Gene Selection by Combining Affinity Propagation Clustering and Sparse Group Lasso

被引:8
作者
Li, Juntao [1 ]
Chang, Mingming [1 ]
Gao, Qinghui [1 ]
Song, Xuekun [2 ]
Gao, Zhiyu [2 ]
机构
[1] Henan Normal Univ, Coll Math & Informat Sci, Xinxiang 453007, Henan, Peoples R China
[2] Henan Univ Chinese Med, Sch Informat Technol, Zhengzhou 450046, Peoples R China
关键词
Lung cancer; gene selection; affinity propagation clustering; sparse group lasso; multi-classification; miRNA; MOLECULAR CLASSIFICATION; CLASS DISCOVERY; EXPRESSION; REGRESSION; REGULARIZATION; PREDICTION; MACHINE;
D O I
10.2174/1574893614666191017103557
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Cancer threatens human health seriously. Diagnosing cancer via gene expression analysis is a hot topic in cancer research. Objective: The study aimed to diagnose the accurate type of lung cancer and discover the pathogenic genes. Methods: In this study, Affinity Propagation (AP) clustering with similarity score was employed to each type of lung cancer and normal lung. After grouping genes, sparse group lasso was adopted to construct four binary classifiers and the voting strategy was used to integrate them. Results: This study screened six gene groups that may associate with different lung cancer sub-types among 73 genes groups, and identified three possible key pathogenic genes, KRAS, BRAF and VDR. Furthermore, this study achieved improved classification accuracies at minority classes SQ and COID in comparison with other four methods. Conclusion: We propose the AP clustering based sparse group lasso (AP-SGL), which provides an alternative for simultaneous diagnosis and gene selection for lung cancer.
引用
收藏
页码:703 / 712
页数:10
相关论文
共 40 条
[1]   Supervised, Unsupervised, and Semi-Supervised Feature Selection: A Review on Gene Selection [J].
Ang, Jun Chin ;
Mirzal, Andri ;
Haron, Habibollah ;
Hamed, Haza Nuzly Abdull .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2016, 13 (05) :971-989
[2]  
[Anonymous], 2006, Journal of the Royal Statistical Society, Series B
[3]   Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses [J].
Bhattacharjee, A ;
Richards, WG ;
Staunton, J ;
Li, C ;
Monti, S ;
Vasa, P ;
Ladd, C ;
Beheshti, J ;
Bueno, R ;
Gillette, M ;
Loda, M ;
Weber, G ;
Mark, EJ ;
Lander, ES ;
Wong, W ;
Johnson, BE ;
Golub, TR ;
Sugarbaker, DJ ;
Meyerson, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (24) :13790-13795
[4]   APCluster: an R package for affinity propagation clustering [J].
Bodenhofer, Ulrich ;
Kothmeier, Andreas ;
Hochreiter, Sepp .
BIOINFORMATICS, 2011, 27 (17) :2463-2464
[5]   Combining affinity propagation clustering and mutual information network to investigate key genes in fibroid [J].
Chen, Qian-Song ;
Wang, Dan ;
Liu, Bao-Lian ;
Ga, Shu-Feng ;
Gao, Dan-Li ;
Li, Gui-Rong .
EXPERIMENTAL AND THERAPEUTIC MEDICINE, 2017, 14 (01) :251-259
[6]   Recursive l1,∞ Group Lasso [J].
Chen, Yilun ;
Hero, Alfred O. .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2012, 60 (08) :3978-3987
[7]   Identification and Analysis of Cancer Diagnosis Using Probabilistic Classification Vector Machines with Feature Selection [J].
Du, Xiuquan ;
Li, Xinrui ;
Li, Wen ;
Yan, Yuanting ;
Zhang, Yanping .
CURRENT BIOINFORMATICS, 2018, 13 (06) :625-632
[8]   Cancer incidence and mortality worldwide: Sources, methods and major patterns in GLOBOCAN 2012 [J].
Ferlay, Jacques ;
Soerjomataram, Isabelle ;
Dikshit, Rajesh ;
Eser, Sultan ;
Mathers, Colin ;
Rebelo, Marise ;
Parkin, Donald Maxwell ;
Forman, David ;
Bray, Freddie .
INTERNATIONAL JOURNAL OF CANCER, 2015, 136 (05) :E359-E386
[9]   Clustering by passing messages between data points [J].
Frey, Brendan J. ;
Dueck, Delbert .
SCIENCE, 2007, 315 (5814) :972-976
[10]   Regularization Paths for Generalized Linear Models via Coordinate Descent [J].
Friedman, Jerome ;
Hastie, Trevor ;
Tibshirani, Rob .
JOURNAL OF STATISTICAL SOFTWARE, 2010, 33 (01) :1-22