Two-stage penalized algorithms via integrating prior information improve gene selection from omics data

被引:3
作者
Chen, Shunjie [1 ]
Yang, Sijia [1 ]
Wang, Pei [1 ,2 ,3 ]
Xue, Liugen [1 ]
机构
[1] Henan Univ, Sch Math & Stat, Kaifeng 475004, Peoples R China
[2] Henan Univ, Henan Engn Res Ctr Ind Internet Things, Zhengzhou 450046, Peoples R China
[3] Henan Univ, Ctr Appl Math Henan Prov, Kaifeng 475004, Peoples R China
基金
中国国家自然科学基金;
关键词
Two -stage penalized regression; Prior information; Dimensional reduction; Gene selection; Omics data; ACTIVATED PROTEIN-KINASE; VARIABLE SELECTION; REGULARIZATION; ASSOCIATION; REGRESSION; EXPRESSION; PROGNOSIS; MAPK3/1; MODELS;
D O I
10.1016/j.physa.2023.129164
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
With the rapid development of cancer biology, considerable cancer driver genes have been determined either experimentally or theoretically, which can be served as prior information. Increasingly accumulated omics data urgently requires efficient statistical learning algorithms to incorporate the prior information for further exploring various cancers. In this paper, four two-stage algorithms that integrate prior information are developed. The first stage of the algorithms integrates the prior information into representative response variables via principal component analysis (PCA), factor analysis or weighted group Lasso penalized logistic regression. In the second stage, penalized linear regression models with Lasso or elastic net are established. The performances of algorithms both in simulated data and 26 real-world cancer datasets are explored. One of the algorithms, which is called PCALasso, has its merits in terms of accuracy and robustness in gene selection. Comparing among eight algorithms, the PCALasso obtains moderately sparse results, correctly screens all desired variables from simulation data, and well identifies actually informative genes from various cancer datasets, which is a promising algorithm for gene selection from omics data. (c) 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:15
相关论文
共 53 条
[1]   Diagnosis of breast cancer with Stacked autoencoder and Subspace kNN [J].
Adem, Kemal .
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2020, 551
[2]   Prognostic significance of AMP-activated protein kinase expression and modifying effect of MAPK3/1 in colorectal cancer [J].
Baba, Y. ;
Nosho, K. ;
Shima, K. ;
Meyerhardt, J. A. ;
Chan, A. T. ;
Engelman, J. A. ;
Cantley, L. C. ;
Loda, M. ;
Giovannucci, E. ;
Fuchs, C. S. ;
Ogino, S. .
BRITISH JOURNAL OF CANCER, 2010, 103 (07) :1025-1033
[3]   Network medicine: a network-based approach to human disease [J].
Barabasi, Albert-Laszlo ;
Gulbahce, Natali ;
Loscalzo, Joseph .
NATURE REVIEWS GENETICS, 2011, 12 (01) :56-68
[4]   Exploring drought-responsive crucial genes in Sorghum [J].
Bi, Yilin ;
Wang, Pei .
ISCIENCE, 2022, 25 (11)
[5]  
Bühlmann P, 2011, SPRINGER SER STAT, P1, DOI 10.1007/978-3-642-20192-9
[6]   Prognostic biomarker SMARCC1 and its association with immune infiltrates in hepatocellular carcinoma [J].
Cai, Xiaopeng ;
Zhou, Jiaming ;
Deng, Jingwen ;
Chen, Zhi .
CANCER CELL INTERNATIONAL, 2021, 21 (01)
[7]   Pan-cancer analysis of whole genomes [J].
Campbell, Peter J. ;
Getz, Gad ;
Korbel, Jan O. ;
Stuart, Joshua M. ;
Jennings, Jennifer L. ;
Stein, Lincoln D. ;
Perry, Marc D. ;
Nahal-Bose, Hardeep K. ;
Ouellette, B. F. Francis ;
Li, Constance H. ;
Rheinbay, Esther ;
Nielsen, G. Petur ;
Sgroi, Dennis C. ;
Wu, Chin-Lee ;
Faquin, William C. ;
Deshpande, Vikram ;
Boutros, Paul C. ;
Lazar, Alexander J. ;
Hoadley, Katherine A. ;
Louis, David N. ;
Dursi, L. Jonathan ;
Yung, Christina K. ;
Bailey, Matthew H. ;
Saksena, Gordon ;
Raine, Keiran M. ;
Buchhalter, Ivo ;
Kleinheinz, Kortine ;
Schlesner, Matthias ;
Zhang, Junjun ;
Wang, Wenyi ;
Wheeler, David A. ;
Ding, Li ;
Simpson, Jared T. ;
O'Connor, Brian D. ;
Yakneen, Sergei ;
Ellrott, Kyle ;
Miyoshi, Naoki ;
Butler, Adam P. ;
Royo, Romina ;
Shorser, Solomon, I ;
Vazquez, Miguel ;
Rausch, Tobias ;
Tiao, Grace ;
Waszak, Sebastian M. ;
Rodriguez-Martin, Bernardo ;
Shringarpure, Suyash ;
Wu, Dai-Ying ;
Demidov, German M. ;
Delaneau, Olivier ;
Hayashi, Shuto .
NATURE, 2020, 578 (7793) :82-+
[8]   Differential regulation of AKT1 contributes to survival and proliferation in hepatocellular carcinoma cells by mediating Notch1 expression [J].
Chen, Jing ;
Liang, Jun ;
Liu, Shihai ;
Song, Shanai ;
Guo, Wenxuan ;
Shen, Fangzhen .
ONCOLOGY LETTERS, 2018, 15 (05) :6857-6864
[9]   Comprehensivemolecular characterization of clear cell renal cell carcinoma [J].
Creighton, Chad J. ;
Morgan, Margaret ;
Gunaratne, Preethi H. ;
Wheeler, David A. ;
Gibbs, Richard A. ;
Robertson, A. Gordon ;
Chu, Andy ;
Beroukhim, Rameen ;
Cibulskis, Kristian ;
Signoretti, Sabina ;
Vandin, Fabio ;
Wu, Hsin-Ta ;
Raphael, Benjamin J. ;
Verhaak, Roel G. W. ;
Tamboli, Pheroze ;
Torres-Garcia, Wandaliz ;
Akbani, Rehan ;
Weinstein, John N. ;
Reuter, Victor ;
Hsieh, James J. ;
Brannon, A. Rose ;
Hakimi, A. Ari ;
Jacobsen, Anders ;
Ciriello, Giovanni ;
Reva, Boris ;
Ricketts, Christopher J. ;
Linehan, W. Marston ;
Stuart, Joshua M. ;
Rathmell, W. Kimryn ;
Shen, Hui ;
Laird, Peter W. ;
Muzny, Donna ;
Davis, Caleb ;
Morgan, Margaret ;
Xi, Liu ;
Chang, Kyle ;
Kakkar, Nipun ;
Trevino, Lisa R. ;
Benton, Susan ;
Reid, Jeffrey G. ;
Morton, Donna ;
Doddapaneni, Harsha ;
Han, Yi ;
Lewis, Lora ;
Dinh, Huyen ;
Kovar, Christie ;
Zhu, Yiming ;
Santibanez, Jireh ;
Wang, Min ;
Hale, Walker .
NATURE, 2013, 499 (7456) :43-+
[10]   Performance analysis of clustering techniques over microarray data: A case study [J].
Dash, Rasmita ;
Misra, Bijan Bihari .
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2018, 493 :162-176