compound.Cox: Univariate feature selection and compound covariate for predicting survival

被引:73
作者
Emura, Takeshi [1 ]
Matsui, Shigeyuki [2 ]
Chen, Hsuan-Yu [3 ]
机构
[1] Natl Cent Univ, Grad Inst Stat, Zhongda Rd, Taoyuan 32001, Taiwan
[2] Nagoya Univ, Grad Sch Med, Dept Biostat, Showa Ku, 65 Tsurumai Cho, Nagoya, Aichi 4668550, Japan
[3] Acad Sinica, Inst Stat Sci, 128 Acad Rd Sec 2, Nankang 115, Taiwan
关键词
Cancer prognosis; Copula; Cox regression; Cross-validation; Dependent censoring; False discovery rate; Gene expression; High-dimensional data; Multiple testing; EXPRESSION; DISCRIMINATION; CANCER; TIME;
D O I
10.1016/j.cmpb.2018.10.020
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Background and objective: Univariate feature selection is one of the simplest and most commonly used techniques to develop a multigene predictor for survival. Presently, there is no software tailored to perform univariate feature selection and predictor construction. Methods: We develop the compound.Cox R package that implements univariate significance tests (via the Wald tests or score tests) for feature selection. We provide a cross-validation algorithm to measure predictive capability of selected genes and a permutation algorithm to assess the false discovery rate. We also provide three algorithms for constructing a multigene predictor (compound covariate, compound shrinkage, and copula-based methods), which are tailored to the subset of genes obtained from univariate feature selection. We demonstrate our package using survival data on the lung cancer patients. We examine the predictive capability of the developed algorithms by the lung cancer data and simulated data. Results: The developed R package, compound.Cox, is available on the CRAN repository. The statistical tools in compound.Cox allow researchers to determine an optimal significance level of the tests, thus providing researchers an optimal subset of genes for prediction. The package also allows researchers to compute the false discovery rate and various prediction algorithms. (C) 2018 Published by Elsevier B.V.
引用
收藏
页码:21 / 37
页数:17
相关论文
共 48 条
[1]   Gene-expression profiles predict survival of patients with lung adenocarcinoma [J].
Beer, DG ;
Kardia, SLR ;
Huang, CC ;
Giordano, TJ ;
Levin, AM ;
Misek, DE ;
Lin, L ;
Chen, GA ;
Gharib, TG ;
Thomas, DG ;
Lizyness, ML ;
Kuick, R ;
Hayasaka, S ;
Taylor, JMG ;
Iannettoni, MD ;
Orringer, MB ;
Hanash, S .
NATURE MEDICINE, 2002, 8 (08) :816-824
[2]   Predicting survival from microarray data -: a comparative study [J].
Bovelstad, H. M. ;
Nygard, S. ;
Storvold, H. L. ;
Aldrin, M. ;
Borgan, O. ;
Frigessi, A. ;
Lingjaerde, O. C. .
BIOINFORMATICS, 2007, 23 (16) :2080-2087
[3]   A five-gene signature and clinical outcome in non-small-cell lung cancer [J].
Chen, Hsuan-Yu ;
Yu, Sung-Liang ;
Chen, Chun-Houh ;
Chang, Gee-Chen ;
Chen, Chih-Yi ;
Yuan, Ang ;
Cheng, Chiou-Ling ;
Wang, Chien-Hsun ;
Terng, Harn-Jing ;
Kao, Shu-Fang ;
Chan, Wing-Kai ;
Li, Han-Ni ;
Liu, Chun-Chi ;
Singh, Sher ;
Chen, Wei J. ;
Chen, Jeremy J. W. ;
Yang, Pan-Chyr .
NEW ENGLAND JOURNAL OF MEDICINE, 2007, 356 (01) :11-20
[4]   Semiparametric marginal regression analysis for dependent competing risks under an assumed copula [J].
Chen, Yi-Hau .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2010, 72 :235-251
[5]  
Creswell Kasey G, 2015, Nicotine Tob Res, V17, P566, DOI 10.1093/ntr/ntu192
[6]   Comparison of discrimination methods for the classification of tumors using gene expression data [J].
Dudoit, S ;
Fridlyand, J ;
Speed, TP .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2002, 97 (457) :77-87
[7]  
Emura T., 2018, Analysis of Survival Data with Dependent Censoring, Copula-Based Approaches
[8]  
Emura T., 2018, COMPOUND COX UNIVARI
[9]   Personalized dynamic prediction of death according to tumour progression and high-dimensional genetic factors: Meta-analysis with a joint model [J].
Emura, Takeshi ;
Nakatochi, Masahiro ;
Matsui, Shigeyuki ;
Michimae, Hirofumi ;
Rondeau, Virginie .
STATISTICAL METHODS IN MEDICAL RESEARCH, 2018, 27 (09) :2842-2858
[10]   A joint frailty-copula model between tumour progression and death for meta-analysis [J].
Emura, Takeshi ;
Nakatochi, Masahiro ;
Murotani, Kenta ;
Rondeau, Virginie .
STATISTICAL METHODS IN MEDICAL RESEARCH, 2017, 26 (06) :2649-2666