Single-sample SNP detection by empirical Bayes method using next generation sequencing data

被引:1
作者
Ding, Weijie
Kou, Qiang
Wang, Xueqin [1 ,2 ]
Xu, Qiuya
You, Na [1 ,2 ]
机构
[1] Sun Yat Sen Univ, Sch Math & Computat Sci, Guangzhou 510275, Guangdong, Peoples R China
[2] Sun Yat Sen Univ, South China Res Ctr Stat, Guangzhou 510275, Guangdong, Peoples R China
关键词
Next generation sequencing; Single-sample; Genotyping; SNP detection; Empirical Bayes method; MODEL;
D O I
10.4310/SII.2015.v8.n4.a5
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The rapid development of next generation sequencing technology is changing the way of biological research in many aspects, which has become the most popular platform for the genomic structural variation detection. In this paper, we focus on the single-sample next generation sequencing data analysis, and propose a hierarchical structure to model the dispersion of minor allele frequency in the genome scale. The empirical Bayes method is employed to estimate the hyper-parameters, and the minor allele is identified as a sequencing error or heterozygous allele according to the posterior probabilities. We suggest to leave the ambiguous positions with moderate posterior probabilities ungenotyped for better genotype-call error control. The performances of our proposed method are investigated by simulations and a real dataset.
引用
收藏
页码:457 / 462
页数:6
相关论文
共 14 条
[1]  
Abecasis G.R., 2012, NATURE, V491, P56, DOI DOI 10.1038/nature11632
[2]   A map of human genome variation from population-scale sequencing [J].
Altshuler, David ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Collins, Francis S. ;
De la Vega, Francisco M. ;
Donnelly, Peter ;
Egholm, Michael ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Knoppers, Bartha M. ;
Lander, Eric S. ;
Lehrach, Hans ;
Mardis, Elaine R. ;
McVean, Gil A. ;
Nickerson, DebbieA. ;
Peltonen, Leena ;
Schafer, Alan J. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Deiros, David ;
Metzker, Mike ;
Muzny, Donna ;
Reid, Jeff ;
Wheeler, David ;
Wang, Jun ;
Li, Jingxiang ;
Jian, Min ;
Li, Guoqing ;
Li, Ruiqiang ;
Liang, Huiqing ;
Tian, Geng ;
Wang, Bo ;
Wang, Jian ;
Wang, Wei ;
Yang, Huanming ;
Zhang, Xiuqing ;
Zheng, Huisong ;
Lander, Eric S. ;
Altshuler, David L. ;
Ambrogio, Lauren ;
Bloom, Toby ;
Cibulskis, Kristian ;
Fennell, Tim J. ;
Gabriel, Stacey B. .
NATURE, 2010, 467 (7319) :1061-1073
[3]  
[Anonymous], 2009, BIOINFORMATICS
[4]  
[Anonymous], ANN APPL STAT
[5]   A framework for variation discovery and genotyping using next-generation DNA sequencing data [J].
DePristo, Mark A. ;
Banks, Eric ;
Poplin, Ryan ;
Garimella, Kiran V. ;
Maguire, Jared R. ;
Hartl, Christopher ;
Philippakis, Anthony A. ;
del Angel, Guillermo ;
Rivas, Manuel A. ;
Hanna, Matt ;
McKenna, Aaron ;
Fennell, Tim J. ;
Kernytsky, Andrew M. ;
Sivachenko, Andrey Y. ;
Cibulskis, Kristian ;
Gabriel, Stacey B. ;
Altshuler, David ;
Daly, Mark J. .
NATURE GENETICS, 2011, 43 (05) :491-+
[6]  
Li H., 2010, Mathematical notes on SAMtools algorithms
[7]   SNP detection for massively parallel whole-genome resequencing [J].
Li, Ruiqiang ;
Li, Yingrui ;
Fang, Xiaodong ;
Yang, Huanming ;
Wang, Jian ;
Kristiansen, Karsten ;
Wang, Jun .
GENOME RESEARCH, 2009, 19 (06) :1124-1132
[8]   SeqEM: an adaptive genotype-calling approach for next-generation sequencing studies [J].
Martin, E. R. ;
Kinnamon, D. D. ;
Schmidt, M. A. ;
Powell, E. H. ;
Zuchner, S. ;
Morris, R. W. .
BIOINFORMATICS, 2010, 26 (22) :2803-2810
[9]   APPLICATIONS OF NEXT-GENERATION SEQUENCING Sequencing technologies - the next generation [J].
Metzker, Michael L. .
NATURE REVIEWS GENETICS, 2010, 11 (01) :31-46
[10]   DETECTING MUTATIONS IN MIXED SAMPLE SEQUENCING DATA USING EMPIRICAL BAYES [J].
Muralidharan, Omkar ;
Natsoulis, Georges ;
Bell, John ;
Ji, Hanlee ;
Zhang, Nancy R. .
ANNALS OF APPLIED STATISTICS, 2012, 6 (03) :1047-1067