Linear reduction methods for tag SNP selection

被引:0
作者
He, JW [1 ]
Zelikovsky, A [1 ]
机构
[1] Georgia State Univ, Dept Comp Sci, Atlanta, GA 30303 USA
来源
PROCEEDINGS OF THE 26TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7 | 2004年 / 26卷
关键词
single nucleotide polymorphism; tag SNP; linear independence;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
It is widely hoped that constructing a complete human haplotype map will help to associate complex diseases with certain SNP's. Unfortunately, the number of SNP's is huge and it is very costly to sequence many individuals. Therefore, it is desirable to reduce the number of SNP's that should be sequenced to considerably small number of informative representatives, so called tag SNP's. In this paper, we propose a new linear algebra based method for selecting and using tag SNP's. Our method is purely combinatorial and can be combined with linkage disequilibrium (LD) and block based methods. We measure the quality of our tag SNP selection algorithm by comparing actual SNP's with SNP's linearly predicted from linearly chosen tag SNP's. We obtain an extremely good compression and prediction rates. For example, for long haplotypes (> 25000 SNP's), knowing only 0.4% of all SNP's we predict the entire unknown haplotype with 2% accuracy while the prediction method is based on a 10% sample of the population.
引用
收藏
页码:2840 / 2843
页数:4
相关论文
共 12 条
  • [1] Avi-Itzhak Hadar I, 2003, Pac Symp Biocomput, P466
  • [2] Bafna V., 2003, P 7 ANN INT C COMP M, P9
  • [3] Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium
    Carlson, CS
    Eberle, MA
    Rieder, MJ
    Yi, Q
    Kruglyak, L
    Nickerson, DA
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2004, 74 (01) : 106 - 120
  • [4] Finding genes underlying risk of complex disease by linkage disequilibrium mapping
    Clark, AG
    [J]. CURRENT OPINION IN GENETICS & DEVELOPMENT, 2003, 13 (03) : 296 - 302
  • [5] High-resolution haplotype structure in the human genome
    Daly, MJ
    Rioux, JD
    Schaffner, SE
    Hudson, TJ
    Lander, ES
    [J]. NATURE GENETICS, 2001, 29 (02) : 229 - 232
  • [6] ESKIN E, 2004, IN PRESS BIOINFORMAT
  • [7] The structure of haplotype blocks in the human genome
    Gabriel, SB
    Schaffner, SF
    Nguyen, H
    Moore, JM
    Roy, J
    Blumenstiel, B
    Higgins, J
    DeFelice, M
    Lochner, A
    Faggart, M
    Liu-Cordero, SN
    Rotimi, C
    Adeyemo, A
    Cooper, R
    Ward, R
    Lander, ES
    Daly, MJ
    Altshuler, D
    [J]. SCIENCE, 2002, 296 (5576) : 2225 - 2229
  • [8] HE J, 2004, UNPUB LINEAR REDUCTI
  • [9] Hudson R.R., 1990, Oxford Surveys in Evolutionary Biology, V7, P1
  • [10] How many SNPs does a genome-wide haplotype map require?
    Judson, R
    Salisbury, B
    Schneider, J
    Windemuth, A
    Stephens, JC
    [J]. PHARMACOGENOMICS, 2002, 3 (03) : 379 - 391