Experimental analysis of oligonucleotide microarray design criteria to detect deletions by comparative genomic hybridization

被引:11
作者
Flibotte, Stephane [1 ]
Moerman, Donald G. [2 ,3 ]
机构
[1] BC Canc Agcy, Canadas Michael Smith Genome Sci Ctr, Vancouver, BC V5Z 4S6, Canada
[2] Univ British Columbia, Dept Zool, Vancouver, BC V6T 1Z4, Canada
[3] Univ British Columbia, Michael Smith Labs, Vancouver, BC V6T 1Z4, Canada
关键词
D O I
10.1186/1471-2164-9-497
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Microarray comparative genomic hybridization (CGH) is currently one of the most powerful techniques to measure DNA copy number in large genomes. In humans, microarray CGH is widely used to assess copy number variants in healthy individuals and copy number aberrations associated with various diseases, syndromes and disease susceptibility. In model organisms such as Caenorhabditis elegans (C. elegans) the technique has been applied to detect mutations, primarily deletions, in strains of interest. Although various constraints on oligonucleotide properties have been suggested to minimize non-specific hybridization and improve the data quality, there have been few experimental validations for CGH experiments. For genomic regions where strict design filters would limit the coverage it would also be useful to quantify the expected loss in data quality associated with relaxed design criteria. Results: We have quantified the effects of filtering various oligonucleotide properties by measuring the resolving power for detecting deletions in the human and C. elegans genomes using NimbleGen microarrays. Approximately twice as many oligonucleotides are typically required to be affected by a deletion in human DNA samples in order to achieve the same statistical confidence as one would observe for a deletion in C. elegans. Surprisingly, the ability to detect deletions strongly depends on the oligonucleotide 15-mer count, which is defined as the sum of the genomic frequency of all the constituent 15-mers within the oligonucleotide. A similarity level above 80% to non-target sequences over the length of the probe produces significant cross-hybridization. We recommend the use of a fairly large melting temperature window of up to 10 C, the elimination of repeat sequences, the elimination of homopolymers longer than 5 nucleotides, and a threshold of -1 kcal/mol on the oligonucleotide self-folding energy. We observed very little difference in data quality when varying the oligonucleotide length between 50 and 70, and even when using an isothermal design strategy. Conclusion: We have determined experimentally the effects of varying several key oligonucleotide microarray design criteria for detection of deletions in C. elegans and humans with NimbleGen's CGH technology. Our oligonucleotide design recommendations should be applicable for CGH analysis in most species.
引用
收藏
页数:12
相关论文
共 29 条
[1]   Design considerations for array CGH to oligonucleotide Arrays [J].
Baldocchi, RA ;
Glynne, RJ ;
Chin, K ;
Kowbel, D ;
Collins, C ;
Mack, DH ;
Gray, JW .
CYTOMETRY PART A, 2005, 67A (02) :129-136
[2]   Assessment of algorithms for high throughput detection of genomic copy number variation in oligonucleotide microarray data [J].
Baross, Agnes ;
Delaney, Allen D. ;
Li, H. Irene ;
Nayar, Tarun ;
Flibotte, Stephane ;
Qian, Hong ;
Chan, Susanna Y. ;
Asano, Jennifer ;
Ally, Adrian ;
Cao, Manqiu ;
Birch, Patricia ;
Brown-John, Mabel ;
Fernandes, Nicole ;
Go, Anne ;
Kennedy, Giulia ;
Langlois, Sylvie ;
Eydoux, Patrice ;
Friedman, J. M. ;
Marra, Marco A. .
BMC BIOINFORMATICS, 2007, 8 (1)
[3]   High-resolution analysis of DNA copy number using oligonucleotide microarrays [J].
Bignell, GR ;
Huang, J ;
Greshock, J ;
Watt, S ;
Butler, A ;
West, S ;
Grigorova, M ;
Jones, KW ;
Wei, W ;
Stratton, MR ;
Futreal, PA ;
Weber, B ;
Shapero, MH ;
Wooster, R .
GENOME RESEARCH, 2004, 14 (02) :287-295
[4]   PREDICTING DNA DUPLEX STABILITY FROM THE BASE SEQUENCE [J].
BRESLAUER, KJ ;
FRANK, R ;
BLOCKER, H ;
MARKY, LA .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1986, 83 (11) :3746-3750
[5]   Gene Expression Omnibus: NCBI gene expression and hybridization array data repository [J].
Edgar, R ;
Domrachev, M ;
Lash, AE .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :207-210
[6]   Recurrent DNA copy number variation in the laboratory mouse [J].
Egan, Chris M. ;
Sridhar, Srinath ;
Wigler, Michael ;
Hall, Ira M. .
NATURE GENETICS, 2007, 39 (11) :1384-1389
[7]   Copy number variation: New insights in genome diversity [J].
Freeman, Jennifer L. ;
Perry, George H. ;
Feuk, Lars ;
Redon, Richard ;
McCarroll, Steven A. ;
Altshuler, David M. ;
Aburatani, Hiroyuki ;
Jones, Keith W. ;
Tyler-Smith, Chris ;
Hurles, Matthew E. ;
Carter, Nigel P. ;
Scherer, Stephen W. ;
Lee, Charles .
GENOME RESEARCH, 2006, 16 (08) :949-961
[8]   Optimized design and assessment of whole genome tiling arrays [J].
Graef, Stefan ;
Nielsen, Fiona G. G. ;
Kurtz, Stefan ;
Huynen, Martijn A. ;
Birney, Ewan ;
Stunnenberg, Henk ;
Flicek, Paul .
BIOINFORMATICS, 2007, 23 (13) :I195-I204
[9]   ALEXA: a microarray design platform for alternative expression analysis [J].
Griffith, Malachi ;
Tang, Michelle J. ;
Griffith, Obi L. ;
Morin, Ryan D. ;
Chan, Susanna Y. ;
Asano, Jennifer K. ;
Zeng, Thomas ;
Flibotte, Stephane ;
Ally, Adrian ;
Baross, Agnes ;
Hirst, Martin ;
Jones, Steven J. M. ;
Morin, Gregg B. ;
Tai, Isabella T. ;
Marra, Marco A. .
NATURE METHODS, 2008, 5 (02) :118-118
[10]   Detection of large-scale variation in the human genome [J].
Iafrate, AJ ;
Feuk, L ;
Rivera, MN ;
Listewnik, ML ;
Donahoe, PK ;
Qi, Y ;
Scherer, SW ;
Lee, C .
NATURE GENETICS, 2004, 36 (09) :949-951