Mutual Information for Testing Gene-Environment Interaction

被引:18
作者
Wu, Xuesen [1 ,2 ,5 ]
Jin, Li [1 ,2 ,3 ]
Xiong, Momiao [1 ,2 ,4 ]
机构
[1] Fudan Univ, Sch Life Sci, Theoret Syst Biol Lab, Shanghai 200433, Peoples R China
[2] Fudan Univ, Ctr Evolutionary Biol, Shanghai 200433, Peoples R China
[3] Chinese Acad Sci, SIBS, Inst Computat Biol, MPG, Shanghai, Peoples R China
[4] Univ Texas Hlth Sci Ctr Houston, Sch Publ Hlth, Human Genet Ctr, Houston, TX USA
[5] Bengbu Med Coll Bengbu, Dept Epidemiol & Stat, Anhua, Peoples R China
基金
美国国家卫生研究院;
关键词
MULTIFACTOR-DIMENSIONALITY REDUCTION; RHEUMATOID-ARTHRITIS; POLYMORPHISMS; SMOKING; EPISTASIS; HLA-DRB1; DESIGNS; CANCER;
D O I
10.1371/journal.pone.0004578
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Despite current enthusiasm for investigation of gene-gene interactions and gene-environment interactions, the essential issue of how to define and detect gene-environment interactions remains unresolved. In this report, we define gene-environment interactions as a stochastic dependence in the context of the effects of the genetic and environmental risk factors on the cause of phenotypic variation among individuals. We use mutual information that is widely used in communication and complex system analysis to measure gene-environment interactions. We investigate how gene-environment interactions generate the large difference in the information measure of gene-environment interactions between the general population and a diseased population, which motives us to develop mutual information-based statistics for testing gene-environment interactions. We validated the null distribution and calculated the type 1 error rates for the mutual information-based statistics to test gene-environment interactions using extensive simulation studies. We found that the new test statistics were more powerful than the traditional logistic regression under several disease models. Finally, in order to further evaluate the performance of our new method, we applied the mutual information-based statistics to three real examples. Our results showed that P-values for the mutual information-based statistics were much smaller than that obtained by other approaches including logistic regression models.
引用
收藏
页数:12
相关论文
共 39 条
[1]   Interaction: A word with two meanings creates confusion [J].
Ahlbom, A ;
Alfredsson, L .
EUROPEAN JOURNAL OF EPIDEMIOLOGY, 2005, 20 (07) :563-564
[2]   Epidemiologic and genetic approaches in the study of gene-environment interaction: an overview of available methods [J].
Andrieu, N ;
Goldstein, AM .
EPIDEMIOLOGIC REVIEWS, 1998, 20 (02) :137-147
[3]  
[Anonymous], 1991, ELEMENTS INFORM THEO
[4]  
[Anonymous], PSYCHOMETRIKA
[5]   Locality of global stochastic interaction in directed acyclic networks [J].
Ay, N .
NEURAL COMPUTATION, 2002, 14 (12) :2959-2980
[6]  
Brillinger D. R., 2004, BRAZ J PROBAB STAT, V18, P163
[7]   Parallel multifactor dimensionality reduction: a tool for the large-scale analysis of gene-gene interactions [J].
Bush, William S. ;
Dudek, Scott M. ;
Ritchie, Marylyn D. .
BIOINFORMATICS, 2006, 22 (17) :2173-2174
[8]   Powerful multilocus tests of genetic association in the presence of gene-gene and gene-environment interactions [J].
Chatterjee, Nilanjan ;
Kalaylioglu, Zeynep ;
Moslehi, Roxana ;
Peters, Ulrike ;
Wacholder, Sholom .
AMERICAN JOURNAL OF HUMAN GENETICS, 2006, 79 (06) :1002-1016
[9]  
CHEVERUD JM, 1995, GENETICS, V139, P1455
[10]   Odds ratio based multifactor-dimensionality reduction method for detecting gene-gene interactions [J].
Chung, Yujin ;
Lee, Seung Yeoun ;
Elston, Robert C. ;
Park, Taesung .
BIOINFORMATICS, 2007, 23 (01) :71-76