Mixed Sequence Reader: A Program for Analyzing DNA Sequences with Heterozygous Base Calling

被引:40
作者
Chang, Chun-Tien [2 ]
Tsai, Chi-Neu [3 ]
Tang, Chuan Yi [2 ]
Chen, Chun-Houh [4 ]
Lian, Jang-Hau [5 ]
Hu, Chi-Yu [5 ]
Tsai, Chia-Lung [1 ,6 ]
Chao, Angel [1 ]
Lai, Chyong-Huey [1 ]
Wang, Tzu-Hao [1 ,6 ]
Lee, Yun-Shien [5 ,6 ]
机构
[1] Chang Gung Univ, Chang Gung Mem Hosp, Lin Kou Med Ctr, Dept Obstet & Gynecol, Tao Yuan 333, Taiwan
[2] Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu, Taiwan
[3] Chang Gung Univ, Grad Inst Clin Med Sci, Tao Yuan 333, Taiwan
[4] Acad Sinica, Inst Stat Sci, Taipei 11529, Taiwan
[5] Ming Chuan Univ, Dept Biotechnol, Tao Yuan, Taiwan
[6] Chang Gung Mem Hosp, Genom Med Res Core Lab, Tao Yuan 333, Taiwan
关键词
COPY NUMBER VARIATION; EXPRESSION; GENETICS; CANCER; RISK;
D O I
10.1100/2012/365104
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The direct sequencing of PCR products generates heterozygous base-calling fluorescence chromatograms that are useful for identifying single-nucleotide polymorphisms (SNPs), insertion-deletions (indels), short tandem repeats (STRs), and paralogous genes. Indels and STRs can be easily detected using the currently available Indelligent or ShiftDetector programs, which do not search reference sequences. However, the detection of other genomic variants remains a challenge due to the lack of appropriate tools for heterozygous base-calling fluorescence chromatogram data analysis. In this study, we developed a free web-based program, Mixed Sequence Reader (MSR), which can directly analyze heterozygous base-calling fluorescence chromatogram data in .abi file format using comparisons with reference sequences. The heterozygous sequences are identified as two distinct sequences and aligned with reference sequences. Our results showed that MSR may be used to (i) physically locate indel and STR sequences and determine STR copy number by searching NCBI reference sequences; (ii) predict combinations of microsatellite patterns using the Federal Bureau of Investigation Combined DNA Index System (CODIS); (iii) determine human papilloma virus (HPV) genotypes by searching current viral databases in cases of double infections; (iv) estimate the copy number of paralogous genes, such as beta-defensin 4 (DEFB4) and its paralog HSPDP3.
引用
收藏
页数:10
相关论文
共 39 条
[1]   Copy number polymorphism and expression level variation of the human α-defensin genes DEFA1 and DEFA3 [J].
Aldred, PMR ;
Hollox, EJ ;
Armour, JAL .
HUMAN MOLECULAR GENETICS, 2005, 14 (14) :2045-2052
[2]   Accurate, high-throughput typing of copy number variation using paralogue ratios from dispersed repeats [J].
Armour, John A. L. ;
Palla, Raquel ;
Zeeuwen, Patrick L. J. M. ;
den Heijer, Martin ;
Schalkwijk, Joost ;
Hollox, Edward J. .
NUCLEIC ACIDS RESEARCH, 2007, 35 (03)
[3]   STR primer concordance study [J].
Budowle, B ;
Masibay, A ;
Anderson, SJ ;
Barna, C ;
Biega, L ;
Brenneke, S ;
Brown, BL ;
Cramer, J ;
DeGroot, GA ;
Douglas, D ;
Duceman, B ;
Eastman, A ;
Giles, R ;
Hamill, J ;
Haase, DJ ;
Janssen, DW ;
Kupferschmid, TD ;
Lawton, T ;
Lemire, C ;
Llewellyn, B ;
Moretti, T ;
Neves, J ;
Palaski, C ;
Schueler, S ;
Sgueglia, J ;
Sprecher, C ;
Tomsey, C ;
Yet, D .
FORENSIC SCIENCE INTERNATIONAL, 2001, 124 (01) :47-54
[4]   Genetics and genomics of core short tandem repeat loci used in human identity testing [J].
Butler, JM .
JOURNAL OF FORENSIC SCIENCES, 2006, 51 (02) :253-265
[5]   Forensic DNA typing by capillary electrophoresis using the ABI Prism 310 and 3100 genetic analyzers for STR analysis [J].
Butler, JM ;
Buel, E ;
Crivellente, F ;
McCord, BR .
ELECTROPHORESIS, 2004, 25 (10-11) :1397-1412
[6]  
Butler JM., 2007, BIOTECHNIQUES, V43, P2, DOI DOI 10.2144/0001
[7]   PolyScan: An automatic indel and SNP detection approach to the analysis of human resequencing data [J].
Chen, Ken ;
McLellan, Michael D. ;
Ding, Li ;
Wendl, Michael C. ;
Kasai, Yumi ;
Wilson, Richard K. ;
Mardis, Elaine R. .
GENOME RESEARCH, 2007, 17 (05) :659-666
[8]   Origins and functional impact of copy number variation in the human genome [J].
Conrad, Donald F. ;
Pinto, Dalila ;
Redon, Richard ;
Feuk, Lars ;
Gokcumen, Omer ;
Zhang, Yujun ;
Aerts, Jan ;
Andrews, T. Daniel ;
Barnes, Chris ;
Campbell, Peter ;
Fitzgerald, Tomas ;
Hu, Min ;
Ihm, Chun Hwa ;
Kristiansson, Kati ;
MacArthur, Daniel G. ;
MacDonald, Jeffrey R. ;
Onyiah, Ifejinelo ;
Pang, Andy Wing Chun ;
Robson, Sam ;
Stirrups, Kathy ;
Valsesia, Armand ;
Walter, Klaudia ;
Wei, John ;
Tyler-Smith, Chris ;
Carter, Nigel P. ;
Lee, Charles ;
Scherer, Stephen W. ;
Hurles, Matthew E. .
NATURE, 2010, 464 (7289) :704-712
[9]   Copy-number variations associated with neuropsychiatric conditions [J].
Cook, Edwin H., Jr. ;
Scherer, Stephen W. .
NATURE, 2008, 455 (7215) :919-923
[10]   Validation of the AMPFlSTR® SGM Plus™ system for use in forensic casework [J].
Cotton, EA ;
Allsop, RF ;
Guest, JL ;
Frazier, RRE ;
Koumi, P ;
Callow, IP ;
Seager, A ;
Sparkes, RL .
FORENSIC SCIENCE INTERNATIONAL, 2000, 112 (2-3) :151-161