Applying genome-wide gene-based expression quantitative trait locus mapping to study population ancestry and pharmacogenetics

被引:7
作者
Yang, Hsin-Chou [1 ,2 ]
Lin, Chien-Wei [1 ]
Chen, Chia-Wei [1 ]
Chen, James J. [3 ]
机构
[1] Acad Sinica, Inst Stat Sci, Taipei 11529, Taiwan
[2] Natl Def Med Ctr, Sch Publ Hlth, Taipei, Taiwan
[3] US FDA, Natl Ctr Toxicol Res, Little Rock, AR USA
关键词
Gene-based approach; Expression quantitative trait locus (eQTL); Partial least squares (PLS); Ancestry-informative marker (AIM); Pharmacogenetics; Adverse drug reaction; Drug response; Drug biotransformation; PARTIAL LEAST-SQUARES; DETERMINING CONTINENTAL ORIGIN; CORONARY-HEART-DISEASE; INFORMATIVE MARKERS; ETHNIC-DIFFERENCES; DRUG DISPOSITION; ASSOCIATION TEST; POLYMORPHISM; NUCLEOTIDE; IDENTIFICATION;
D O I
10.1186/1471-2164-15-319
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Gene-based analysis has become popular in genomic research because of its appealing biological and statistical properties compared with those of a single-locus analysis. However, only a few, if any, studies have discussed a mapping of expression quantitative trait loci (eQTL) in a gene-based framework. Neither study has discussed ancestry-informative eQTL nor investigated their roles in pharmacogenetics by integrating single nucleotide polymorphism (SNP)-based eQTL (s-eQTL) and gene-based eQTL (g-eQTL). Results: In this g-eQTL mapping study, the transcript expression levels of genes (transcript-level genes; T-genes) were correlated with the SNPs of genes (sequence-level genes; S-genes) by using a method of gene-based partial least squares (PLS). Ancestry-informative transcripts were identified using a rank-score-based multivariate association test, and ancestry-informative eQTL were identified using Fisher's exact test. Furthermore, key ancestry-predictive eQTL were selected in a flexible discriminant analysis. We analyzed SNPs and gene expression of 210 independent people of African-, Asian- and European-descent. We identified numerous cis- and trans-acting g-eQTL and s-eQTL for each population by using PLS. We observed ancestry information enriched in eQTL. Furthermore, we identified 2 ancestry-informative eQTL associated with adverse drug reactions and/or drug response. Rs1045642, located on MDR1, is an ancestry-informative eQTL (P = 2.13E-13, using Fisher's exact test) associated with adverse drug reactions to amitriptyline and nortriptyline and drug responses to morphine. Rs20455, located in KIF6, is an ancestry-informative eQTL (P = 2.76E-23, using Fisher's exact test) associated with the response to statin drugs (e.g., pravastatin and atorvastatin). The ancestry-informative eQTL of drug biotransformation genes were also observed; cross-population cis- acting expression regulators included SPG7, TAP2, SLC7A7, and CYP4F2. Finally, we also identified key ancestry-predictive eQTL and established classification models with promising training and testing accuracies in separating samples from close populations. Conclusions: In summary, we developed a gene-based PLS procedure and a SAS macro for identifying g-eQTL and s-eQTL. We established data archives of eQTL for global populations. The program and data archives are accessible at http://www.stat.sinica.edu.tw/hsinchou/genetics/eQTL/HapMapII.htm. Finally, the results from our investigations regarding the interrelationship between eQTL, ancestry information, and pharmacodynamics provide rich resources for future eQTL studies and practical applications in population genetics and medical genetics.
引用
收藏
页数:17
相关论文
共 118 条
[21]   Mapping determinants of human gene expression by regional and genome-wide association [J].
Cheung, VG ;
Spielman, RS ;
Ewens, KG ;
Weber, TM ;
Morley, M ;
Burdick, JT .
NATURE, 2005, 437 (7063) :1365-1369
[22]   The genetics of variation in gene expression [J].
Cheung, VG ;
Spielman, RS .
NATURE GENETICS, 2002, 32 (Suppl 4) :522-525
[23]   Natural variation in human gene expression assessed in lymphoblastoid cells [J].
Cheung, VG ;
Conlin, LK ;
Weber, TM ;
Arcaro, M ;
Jen, KY ;
Morley, M ;
Spielman, RS .
NATURE GENETICS, 2003, 33 (03) :422-425
[24]   Identification of Association Between Disease and Multiple Markers Via Sparse Partial Least-Squares Regression [J].
Chun, Hyonho ;
Ballard, David H. ;
Cho, Judy ;
Zhao, Hongyu .
GENETIC EPIDEMIOLOGY, 2011, 35 (06) :479-486
[25]   A marker for Stevens-Johnson syndrome [J].
Chung, WH ;
Hung, SI ;
Hong, HS ;
Hsih, MS ;
Yang, LC ;
Ho, HC ;
Wu, JY ;
Chen, YT .
NATURE, 2004, 428 (6982) :486-486
[26]   Mapping complex disease traits with global gene expression [J].
Cookson, William ;
Liang, Liming ;
Abecasis, Goncalo ;
Moffatt, Miriam ;
Lathrop, Mark .
NATURE REVIEWS GENETICS, 2009, 10 (03) :184-194
[27]  
Cooper R, 2000, CIRCULATION, V102, P3137
[28]   Mining gene expression databases for association rules [J].
Creighton, C ;
Hanash, S .
BIOINFORMATICS, 2003, 19 (01) :79-86
[29]   Genome-wide association studies in pharmacogenomics [J].
Daly, Ann K. .
NATURE REVIEWS GENETICS, 2010, 11 (04) :241-246
[30]   High-Dimensional Heteroscedastic Regression with an Application to eQTL Data Analysis [J].
Daye, Z. John ;
Chen, Jinbo ;
Li, Hongzhe .
BIOMETRICS, 2012, 68 (01) :316-326