A Java']Java-based tool for the design of classification microarrays

被引:2
作者
Meng, Da [1 ]
Broschat, Shira L. [1 ,2 ,3 ]
Call, Douglas R. [2 ,3 ]
机构
[1] Washington State Univ, Sch Elect Engn & Comp Sci, Pullman, WA 99164 USA
[2] Washington State Univ, Dept Vet Microbiol & Pathol, Pullman, WA 99164 USA
[3] Washington State Univ, Ctr Integrated Biotechnol, Pullman, WA 99164 USA
基金
美国国家卫生研究院;
关键词
D O I
10.1186/1471-2105-9-328
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Classification microarrays are used for purposes such as identifying strains of bacteria and determining genetic relationships to understand the epidemiology of an infectious disease. For these cases, mixed microarrays, which are composed of DNA from more than one organism, are more effective than conventional microarrays composed of DNA from a single organism. Selection of probes is a key factor in designing successful mixed microarrays because redundant sequences are inefficient and limited representation of diversity can restrict application of the microarray. We have developed a Java-based software tool, called PLASMID, for use in selecting the minimum set of probe sequences needed to classify different groups of plasmids or bacteria. Results: The software program was successfully applied to several different sets of data. The utility of PLASMID was illustrated using existing mixed-plasmid microarray data as well as data from a virtual mixed-genome microarray constructed from different strains of Streptococcus. Moreover, use of data from expression microarray experiments demonstrated the generality of PLASMID. Conclusion: In this paper we describe a new software tool for selecting a set of probes for a classification microarray. While the tool was developed for the design of mixed microarrays-and mixed-plasmid microarrays in particular-it can also be used to design expression arrays. The user can choose from several clustering methods (including hierarchical, non-hierarchical, and a model-based genetic algorithm), several probe ranking methods, and several different display methods. A novel approach is used for probe redundancy reduction, and probe selection is accomplished via stepwise discriminant analysis. Data can be entered in different formats (including Excel and comma-delimited text), and dendrogram, heat map, and scatter plot images can be saved in several different formats (including jpeg and tiff). Weights generated using stepwise discriminant analysis can be stored for analysis of subsequent experimental data. Additionally, PLASMID can be used to construct virtual microarrays with genomes from public databases, which can then be used to identify an optimal set of probes.
引用
收藏
页数:11
相关论文
共 19 条
[1]   Discrimination among Listeria monocytogenes isolates using a mixed genome DNA microarray [J].
Borucki, MK ;
Krug, MJ ;
Wayne, WT ;
Call, DR .
VETERINARY MICROBIOLOGY, 2003, 92 (04) :351-362
[2]   Assessing genetic diversity in plasmids from Escherichia coli and Salmonella enterica using a mixed-plasmid microarray [J].
Call, DR ;
Kang, MS ;
Daniels, J ;
Besser, TE .
JOURNAL OF APPLIED MICROBIOLOGY, 2006, 100 (01) :15-28
[3]   Mixed-genome microarrays reveal multiple serotype and lineage-specific differences among strains of Listeria monocytogenes [J].
Call, DR ;
Borucki, MK ;
Besser, TE .
JOURNAL OF CLINICAL MICROBIOLOGY, 2003, 41 (02) :632-639
[4]   Selecting genes by test statistics [J].
Chen, DC ;
Liu, ZQ ;
Ma, XB ;
Hua, D .
JOURNAL OF BIOMEDICINE AND BIOTECHNOLOGY, 2005, (02) :132-138
[5]   Optimization of probe length and the number of probes per gene for optimal microarray analysis of gene expression [J].
Chou, CC ;
Chen, CH ;
Lee, TT ;
Peck, K .
NUCLEIC ACIDS RESEARCH, 2004, 32 (12) :e99
[6]   PROBEmer: a web-based software tool for selecting optimal DNA oligos [J].
Emrich, SJ ;
Lowe, M ;
Delcher, AL .
NUCLEIC ACIDS RESEARCH, 2003, 31 (13) :3746-3750
[7]   Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring [J].
Golub, TR ;
Slonim, DK ;
Tamayo, P ;
Huard, C ;
Gaasenbeek, M ;
Mesirov, JP ;
Coller, H ;
Loh, ML ;
Downing, JR ;
Caligiuri, MA ;
Bloomfield, CD ;
Lander, ES .
SCIENCE, 1999, 286 (5439) :531-537
[8]  
HANCZAR B, 2003, SIGKDD EXPLORATIONS, V5, P23, DOI DOI 10.1145/980972.980977
[9]   Genome-wide selection of unique and valid oligonucleotides -: art. no. e115 [J].
Hyyrö, H ;
Juhola, M ;
Vihinen, M .
NUCLEIC ACIDS RESEARCH, 2005, 33 (13) :1-16
[10]  
Jaeger J, 2003, Pac Symp Biocomput, P53