Discovering disease genes: Multipoint linkage analysis via a new Markov chain Monte Carlo approach

被引:24
作者
George, AW
Thompson, EA
机构
[1] Univ Iowa, Dept Biostat, Iowa City, IA 52242 USA
[2] Univ Washington, Dept Stat, Seattle, WA 98195 USA
关键词
linkage analysis; joint Gibbs updates; integrated proposals; Metropolis-Hastings restarts; sequential imputation;
D O I
10.1214/ss/1081443233
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Multipoint linkage analyses of data collected on related individuals are often performed as a first step in the discovery of disease genes. Through the dependence in inheritance of genes segregating at several linked loci, multipoint linkage analysis detects and localizes chromosomal regions (called trait loci) which contain disease genes. Our ability to correctly detect and position these trait loci is increased with the analysis of data observed on large pedigrees and multiple genetic markers. However, large pedigrees generally contain substantial missing data and exact calculation of the required multipoint likelihoods quickly becomes intractable. In this paper, we present a new Markov chain Monte Carlo approach to multipoint linkage analysis which greatly extends the range of models and data sets for which analysis is practical. Several advances in Markov chain Monte Carlo theory, namely joint updates of latent variables across loci or meioses, integrated proposals, Metropolis-Hastings restarts via sequential imputation and Rao-Blackwellized estimators, are incorporated into a sampling strategy which mixes well and produces accurate results in real time. The methodology is demonstrated through its application to several data sets originating from a study of early-onset Alzheimer's disease in families of Volga-German ethnic origin.
引用
收藏
页码:515 / 531
页数:17
相关论文
共 41 条
[1]  
Baum L.E., 1972, Inequalities III: Proceedings of the Third Symposium on Inequalities, page, V3, P1
[2]   A MAXIMIZATION TECHNIQUE OCCURRING IN STATISTICAL ANALYSIS OF PROBABILISTIC FUNCTIONS OF MARKOV CHAINS [J].
BAUM, LE ;
PETRIE, T ;
SOULES, G ;
WEISS, N .
ANNALS OF MATHEMATICAL STATISTICS, 1970, 41 (01) :164-&
[3]   BAYESIAN COMPUTATION AND STOCHASTIC-SYSTEMS [J].
BESAG, J ;
GREEN, P ;
HIGDON, D ;
MENGERSEN, K .
STATISTICAL SCIENCE, 1995, 10 (01) :3-41
[4]   PROBABILITY FUNCTIONS ON COMPLEX PEDIGREES [J].
CANNINGS, C ;
THOMPSON, EA ;
SKOLNICK, MH .
ADVANCES IN APPLIED PROBABILITY, 1978, 10 (01) :26-61
[5]   EFFECTS OF MIS-SPECIFYING GENETIC-PARAMETERS IN LOD SCORE ANALYSIS [J].
CLERGETDARPOUX, F ;
BONAITIPELLIE, C ;
HOCHEZ, J .
BIOMETRICS, 1986, 42 (02) :393-399
[6]   Multipoint oligogenic analysis of age-at-onset data with applications to Alzheimer disease pedigrees [J].
Daw, EW ;
Heath, SC ;
Wijsman, EM .
AMERICAN JOURNAL OF HUMAN GENETICS, 1999, 64 (03) :839-851
[7]   GENERAL MODEL FOR GENETIC ANALYSIS OF PEDIGREE DATA [J].
ELSTON, RC ;
STEWART, J .
HUMAN HEREDITY, 1971, 21 (06) :523-&
[8]   SAMPLING-BASED APPROACHES TO CALCULATING MARGINAL DENSITIES [J].
GELFAND, AE ;
SMITH, AFM .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1990, 85 (410) :398-409
[9]   ANNEALING MARKOV-CHAIN MONTE-CARLO WITH APPLICATIONS TO ANCESTRAL INFERENCE [J].
GEYER, CJ ;
THOMPSON, EA .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1995, 90 (431) :909-920
[10]   MONTE-CARLO ESTIMATION OF MIXED MODELS FOR LARGE COMPLEX PEDIGREES [J].
GUO, SW ;
THOMPSON, EA .
BIOMETRICS, 1994, 50 (02) :417-432