Gene-environment interaction analysis under the Cox model

被引:0
作者
Fang, Kuangnan [1 ]
Li, Jingmao [1 ]
Xu, Yaqing [2 ]
Ma, Shuangge [3 ]
Zhang, Qingzhao [1 ,4 ]
机构
[1] Xiamen Univ, Sch Econ, Dept Stat & Data Sci, 422, Siming South Rd, Xiamen 361005, Fujian, Peoples R China
[2] Shanghai Jiao Tong Univ, Sch Med, Sch Publ Hlth, 227 South Chongqing Rd, Shanghai 200240, Peoples R China
[3] Yale Sch Publ Hlth, Dept Biostat, 60 Coll St, New Haven, CT 06520 USA
[4] Xiamen Univ, Wang Yanan Inst Studies Econ, 422 Siming South Rd, Xiamen 361005, Fujian, Peoples R China
基金
中国国家自然科学基金;
关键词
Gene-environment interaction analysis; Cox model; Penalized estimation; Asymptotic consistency; VARIABLE SELECTION; WIDE ASSOCIATION; LASSO;
D O I
10.1007/s10463-023-00871-9
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
For the survival of cancer and many other complex diseases, gene-environment (G-E) interactions have been established as having essential importance. G-E interaction analysis can be roughly classified as marginal and joint, depending on the number of G variables analyzed at a time. In this study, we focus on joint analysis, which can better reflect disease biology and is statistically more challenging. Many approaches have been developed for joint G-E interaction analysis for survival outcomes and led to important findings. However, without rigorous statistical development, quite a few methods have a weak theoretical ground. To fill this knowledge gap, in this article, we consider joint G-E interaction analysis under the Cox model. Sparse group penalization is adopted for regularizing estimation and selecting important main effects and interactions. The "main effects, interactions" variable selection hierarchy, which has been strongly advocated in recent literature, is satisfied. Significantly advancing from some published studies, we rigorously establish the consistency properties under high dimensionality. An effective computational algorithm is developed, simulation demonstrates competitive performance of the proposed approach, and analysis of The Cancer Genome Atlas (TCGA) data on stomach adenocarcinoma (STAD) further demonstrates its practical utility.
引用
收藏
页码:931 / 948
页数:18
相关论文
共 32 条
[1]   COX REGRESSION-MODEL FOR COUNTING-PROCESSES - A LARGE SAMPLE STUDY [J].
ANDERSEN, PK ;
GILL, RD .
ANNALS OF STATISTICS, 1982, 10 (04) :1100-1120
[2]   A LASSO FOR HIERARCHICAL INTERACTIONS [J].
Bien, Jacob ;
Taylor, Jonathan ;
Tibshirani, Robert .
ANNALS OF STATISTICS, 2013, 41 (03) :1111-1141
[3]  
Boyd S., 2011, Distributed Optimization and Statistical Learning Via the Alternating Direction Method of Multipliers, DOI DOI 10.1561/2200000016
[4]   REGULARIZATION FOR COX'S PROPORTIONAL HAZARDS MODEL WITH NP-DIMENSIONALITY [J].
Bradic, Jelena ;
Fan, Jianqing ;
Jiang, Jiancheng .
ANNALS OF STATISTICS, 2011, 39 (06) :3092-3120
[5]   Extended Bayesian information criteria for model selection with large model spaces [J].
Chen, Jiahua ;
Chen, Zehua .
BIOMETRIKA, 2008, 95 (03) :759-771
[6]   Variable Selection With the Strong Heredity Constraint and Its Oracle Property [J].
Choi, Nam Hee ;
Li, William ;
Zhu, Ji .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2010, 105 (489) :354-364
[7]   Large sample results for frequentist multiple imputation for Cox regression with missing covariate data [J].
Eriksson, Frank ;
Martinussen, Torben ;
Nielsen, Soren Feodor .
ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2020, 72 (04) :969-996
[8]   Variable selection via nonconcave penalized likelihood and its oracle properties [J].
Fan, JQ ;
Li, RZ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1348-1360
[9]   Variable selection for functional linear models with strong heredity constraint [J].
Feng, Sanying ;
Zhang, Menghan ;
Tong, Tiejun .
ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2022, 74 (02) :321-339
[10]  
Fleming TR., 2005, COUNTING PROCESSES S