Recursive partitioning analysis of complex disease pharmacogenetic studies. I. Motivation and overview

被引:10
作者
Young, SS [1 ]
Ge, NX
机构
[1] Natl Inst Stat Sci, Res Triangle Pk, NC 27709 USA
[2] Aventis Pharmaceut, Drug Discovery Biostat, Bridgewater, NJ 08807 USA
关键词
complex disease; pharmacogenctics; recursive partitioning;
D O I
10.1517/14622416.6.1.65
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
Identifying genetic variation predictive of important phenotypes, including disease susceptibility, drug efficacy, and adverse events, is a challenging task, and theory and computer science work is being carried out in an attempt to tackle this issue. For many important diseases, such as diabetes, schizophrenia, and depression, the etiology is complex; either the disease is a result of several multiple mechanisms or is caused by an interaction among multiple genes or gene-environment interactions, or both. There is a need for statistical methods to deal with the large, complex data sets that will be used to disentangle these diseases. Each putative genetic polymorphism can be tested for association sequentially. The most difficult problem, however, is the identification of combinations of polymorphisms or genetic markers with increased predictive characteristics. Data from clinical trials, where patients with a particular disease are treated with certain drugs, can be retrospectively assembled using a case-control design. Such data will typically include treatment assignment, demographics, medical history, and genotypes for a large number of genetic markers. The number of variables in such data is expected to be much larger than the number of subjects. This report focuses on some of the methods being employed to deal with this complex data and covers, in some detail, a data-mining method - recursive partitioning - to analyze such data. The methods are demonstrated using a complex simulated data set, as there are few available public data sets. This explication of recursive partitioning should provide researchers with a better idea of the current available analysis techniques, in order to allow them to plan their experiments more effectively.
引用
收藏
页码:65 / 75
页数:11
相关论文
共 36 条
[1]   Predicting disease using genomics [J].
Bell, J .
NATURE, 2004, 429 (6990) :453-456
[2]   SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation [J].
Blewitt, Marnie E. ;
Gendrel, Anne-Valerie ;
Pang, Zhenyi ;
Sparrow, Duncan B. ;
Whitelaw, Nadia ;
Craig, Jeffrey M. ;
Apedaile, Anwyn ;
Hilton, Douglas J. ;
Dunwoodie, Sally L. ;
Brockdorff, Neil ;
Kay, Graham F. ;
Whitelaw, Emma .
NATURE GENETICS, 2008, 40 (05) :663-669
[4]   Tree and spline based association analysis of gene-gene interaction models for ischemic stroke [J].
Ccok, NR ;
Zee, RYL ;
Ridker, PM .
STATISTICS IN MEDICINE, 2004, 23 (09) :1439-1453
[5]   Loci on chromosomes 2 (NIDDM1) and 15 interact to increase susceptibility to diabetes in Mexican Americans [J].
Cox, NJ ;
Frigge, M ;
Nicolae, DL ;
Concannon, P ;
Hanis, CL ;
Bell, GI ;
Kong, A .
NATURE GENETICS, 1999, 21 (02) :213-215
[6]  
CURRAN MD, 2003, STAT MODELING GENETI
[7]   Genomic control, a new approach to genetic-based association studies [J].
Devlin, B ;
Roeder, K ;
Wasserman, L .
THEORETICAL POPULATION BIOLOGY, 2001, 60 (03) :155-166
[8]   Inference on haplotype effects in case-control studies using unphased genotype data [J].
Epstein, MP ;
Satten, GA .
AMERICAN JOURNAL OF HUMAN GENETICS, 2003, 73 (06) :1316-1329
[9]   Moving towards individualized medicine with pharmacogenomics [J].
Evans, WE ;
Relling, MV .
NATURE, 2004, 429 (6990) :464-468
[10]   Combining genotype groups and recursive partitioning: an application to human immunodeficiency virus type 1 genetics data [J].
Foulkes, AS ;
De Gruttola, V ;
Hertogs, K .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2004, 53 :311-323