Reliable identification of large numbers of candidate SNPs from public EST data

被引:201
作者
Buetow, KH [1 ]
Edmonson, MN
Cassidy, AB
机构
[1] NCI, Lab Populat Genet, NIH, Bethesda, MD 20892 USA
[2] Fox Chase Canc Ctr, Philadelphia, PA 19111 USA
关键词
D O I
10.1038/6851
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
High-resolution genetic analysis of the human genome promises to provide insight into common disease susceptibility. To perform such analysis will require a collection of high-throughput, high-density analysis reagents. We have developed a polymorphism detection system that uses public-domain sequence data. This detection system is called the single nucleotide polyrmorphism pipeline (SNPpipeline). The analytic core of the SNPpipeline is composed of three components: PHRED, PHRAP and DEMIGLACE. PHRED and PHRAP are components of a sequence analysis suite developed to perform the semi-automated analysis required for large-scale genomes(1,2) (provided courtesy of P. Green). Using these informatics tools, which examine redundant raw expressed sequence tag (EST) data, we have identified more than 3,000 candidate single-nucleotide polyrmorphisms (SNPs). Empiric validation studies of a set of 192 candidates indicate that 82% identify variation in a sample of ten Centre d'Etudes Polymorphism Humain (CEPH) individuals. Our results suggest that existing sequence resources may serve as a valuable source for identifying genetic variation.
引用
收藏
页码:323 / 325
页数:3
相关论文
共 8 条
[1]  
[Anonymous], 1963, PRINCIPLES NUMERICAL
[2]   Base-calling of automated sequencer traces using phred.: II.: Error probabilities [J].
Ewing, B ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :186-194
[3]   Base-calling of automated sequencer traces using phred.: I.: Accuracy assessment [J].
Ewing, B ;
Hillier, L ;
Wendl, MC ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :175-185
[4]   Generation and analysis of 280,000 human expressed sequence tags [J].
Hillier, L ;
Lennon, G ;
Becker, M ;
Bonaldo, MF ;
Chiapelli, B ;
Chissoe, S ;
Dietrich, N ;
DuBuque, T ;
Favello, A ;
Gish, W ;
Hawkins, M ;
Hultman, M ;
Kucaba, T ;
Lacy, M ;
Le, M ;
Le, N ;
Mardis, E ;
Moore, B ;
Morris, M ;
Parsons, J ;
Prange, C ;
Rifkin, L ;
Rohlfing, T ;
Schellenberg, K ;
Soares, MB ;
Tan, F ;
ThierryMeg, J ;
Trevaskis, E ;
Underwood, K ;
Wohldman, P ;
Waterston, R ;
Wilson, R ;
Marra, M .
GENOME RESEARCH, 1996, 6 (09) :807-828
[5]  
JIN L, 1990, MOL BIOL EVOL, V7, P82
[6]   A COMPREHENSIVE HUMAN LINKAGE WITH CENTIMORGAN DENSITY [J].
MURRAY, JC ;
BUETOW, KH ;
WEBER, JL ;
LUDWIGSEN, S ;
SCHERPBIERHEDDEMA, T ;
MANION, F ;
QUILLEN, J ;
SHEFFIELD, VC ;
SUNDEN, S ;
DUYK, GM ;
WEISSENBACH, J ;
GYAPAY, G ;
DIB, C ;
MORRISSETTE, J ;
LATHROP, GM ;
VIGNAL, A ;
WHITE, R ;
MATSUNAMI, N ;
GERKEN, S ;
MELIS, R ;
ALBERTSEN, H ;
PLAETKE, R ;
ODELBERG, S ;
WARD, D ;
DAUSSET, J ;
COHEN, D ;
CANN, H .
SCIENCE, 1994, 265 (5181) :2049-2054
[7]   Pieces of the puzzle: expressed sequence tags and the catalog of human genes [J].
Schuler, GD .
JOURNAL OF MOLECULAR MEDICINE-JMM, 1997, 75 (10) :694-698
[8]   Large-scale identification, mapping, and genotyping of single-nucleotide polymorphisms in the human genome [J].
Wang, DG ;
Fan, JB ;
Siao, CJ ;
Berno, A ;
Young, P ;
Sapolsky, R ;
Ghandour, G ;
Perkins, N ;
Winchester, E ;
Spencer, J ;
Kruglyak, L ;
Stein, L ;
Hsie, L ;
Topaloglou, T ;
Hubbell, E ;
Robinson, E ;
Mittmann, M ;
Morris, MS ;
Shen, NP ;
Kilburn, D ;
Rioux, J ;
Nusbaum, C ;
Rozen, S ;
Hudson, TJ ;
Lipshutz, R ;
Chee, M ;
Lander, ES .
SCIENCE, 1998, 280 (5366) :1077-1082