Significance analysis of lexical bias in microarray data

被引:23
作者
Kim, CC [1 ]
Falkow, S [1 ]
机构
[1] Stanford Univ, Med Ctr, Stanford, CA 94305 USA
关键词
D O I
10.1186/1471-2105-4-12
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Genes that are determined to be significantly differentially regulated in microarray analyses often appear to have functional commonalities, such as being components of the same biochemical pathway. This results in certain words being under- or overrepresented in the list of genes. Distinguishing between biologically meaningful trends and artifacts of annotation and analysis procedures is of the utmost importance, as only true biological trends are of interest for further experimentation. A number of sophisticated methods for identification of significant lexical trends are currently available, but these methods are generally too cumbersome for practical use by most microarray users. Results: We have developed a tool, LACK, for calculating the statistical significance of apparent lexical bias in microarray datasets. The frequency of a user-specified list of search terms in a list of genes which are differentially regulated is assessed for statistical significance by comparison to randomly generated datasets. The simplicity of the input files and user interface targets the average microarray user who wishes to have a statistical measure of apparent lexical trends in analyzed datasets without the need for bioinformatics skills. The software is available as Perl source or a Windows executable. Conclusion: We have used LACK in our laboratory to generate biological hypotheses based on our microarray data. We demonstrate the program's utility using an example in which we confirm significant upregulation of SP1-2 pathogenicity island of Salmonella enterica serovar Typhimurium by the cation chelator dipyridyl.
引用
收藏
页数:6
相关论文
共 16 条
  • [1] Whole-genome expression analysis: challenges beyond clustering
    Altman, RB
    Raychaudhuri, S
    [J]. CURRENT OPINION IN STRUCTURAL BIOLOGY, 2001, 11 (03) : 340 - 347
  • [2] Mining functional information associated with expression arrays
    Blaschke C.
    Oliveros J.C.
    Valencia A.
    [J]. Functional & Integrative Genomics, 2001, 1 (4) : 256 - 268
  • [3] Genomic comparison of Salmonella enterica serovars and Salmonella bongori by use of an S-enterica serovar Typhimurium DNA microarray
    Chan, K
    Baker, S
    Kim, CC
    Detweiler, CS
    Dougan, G
    Falkow, S
    [J]. JOURNAL OF BACTERIOLOGY, 2003, 185 (02) : 553 - 563
  • [4] Cluster analysis and display of genome-wide expression patterns
    Eisen, MB
    Spellman, PT
    Brown, PO
    Botstein, D
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) : 14863 - 14868
  • [5] Genes encoding putative effector proteins of the type III secretion system of Salmonella pathogenicity island 2 are required for bacterial virulence and proliferation in macrophages
    Hensel, M
    Shea, JE
    Waterman, SR
    Mundy, R
    Nikolaus, T
    Banks, G
    Vazquez-Torres, A
    Gleeson, C
    Fang, FC
    Holden, DW
    [J]. MOLECULAR MICROBIOLOGY, 1998, 30 (01) : 163 - 174
  • [6] A literature network of human genes for high-throughput analysis of gene expression
    Jenssen, TK
    Lægreid, A
    Komorowski, J
    Hovig, E
    [J]. NATURE GENETICS, 2001, 28 (01) : 21 - +
  • [7] Practical approaches to analyzing results of microarray experiments
    Kaminski, N
    Friedman, N
    [J]. AMERICAN JOURNAL OF RESPIRATORY CELL AND MOLECULAR BIOLOGY, 2002, 27 (02) : 125 - 132
  • [8] Liu X, 2001, Pac Symp Biocomput, P127
  • [9] Use of keyword hierarchies to interpret gene expression patterns
    Masys, DR
    Welsh, JB
    Fink, JL
    Gribskov, M
    Klacansky, I
    Corbeil, J
    [J]. BIOINFORMATICS, 2001, 17 (04) : 319 - 326
  • [10] Complete genome sequence of Salmonella enterica serovar typhimurium LT2
    McClelland, M
    Sanderson, KE
    Spieth, J
    Clifton, SW
    Latreille, P
    Courtney, L
    Porwollik, S
    Ali, J
    Dante, M
    Du, FY
    Hou, SF
    Layman, D
    Leonard, S
    Nguyen, C
    Scott, K
    Holmes, A
    Grewal, N
    Mulvaney, E
    Ryan, E
    Sun, H
    Florea, L
    Miller, W
    Stoneking, T
    Nhan, M
    Waterston, R
    Wilson, RK
    [J]. NATURE, 2001, 413 (6858) : 852 - 856