Use of keyword hierarchies to interpret gene expression patterns

被引:84
作者
Masys, DR
Welsh, JB
Fink, JL
Gribskov, M
Klacansky, I
Corbeil, J
机构
[1] Univ Calif San Diego, Sch Med, Ctr Canc, Dept Med, La Jolla, CA 92093 USA
[2] Univ Calif San Diego, Ctr Canc, Dept Pathol, La Jolla, CA 92093 USA
[3] Univ Calif San Diego, Ctr Canc, Dept Biol, La Jolla, CA 92093 USA
[4] San Diego Supercomp Ctr, San Diego, CA 92161 USA
[5] Vet Med Res Fdn, San Diego, CA 92161 USA
关键词
D O I
10.1093/bioinformatics/17.4.319
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: High-density microarray technology permits the quantitative and simultaneous monitoring of thousands of genes. The interpretation challenge is to extract relevant information from this large amount of data. A growing variety of statistical analysis approaches are available to identify clusters of genes that share common expression characteristics, but provide no information regarding the biological similarities of genes within clusters. The published literature provides a potential source of information to assist in interpretation of clustering results. Results: We describe a data mining method that uses indexing terms ('keywords') from the published literature linked to specific genes to present a view of the conceptual similarity of genes within a cluster or group of interest. The method takes advantage of the hierarchical nature of Medical Subject Headings used to index citations in the MEDLINE database, and the registry numbers applied to enzymes.
引用
收藏
页码:319 / 326
页数:8
相关论文
共 13 条
  • [1] Carlisle AJ, 2000, MOL CARCINOGEN, V28, P12, DOI 10.1002/(SICI)1098-2744(200005)28:1<12::AID-MC3>3.0.CO
  • [2] 2-Q
  • [3] A genome-wide transcriptional analysis of the mitotic cell cycle
    Cho, RJ
    Campbell, MJ
    Winzeler, EA
    Steinmetz, L
    Conway, A
    Wodicka, L
    Wolfsberg, TG
    Gabrielian, AE
    Landsman, D
    Lockhart, DJ
    Davis, RW
    [J]. MOLECULAR CELL, 1998, 2 (01) : 65 - 73
  • [4] Cluster analysis and display of genome-wide expression patterns
    Eisen, MB
    Spellman, PT
    Brown, PO
    Botstein, D
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) : 14863 - 14868
  • [5] Data management and analysis for gene expression arrays
    Ermolaeva, O
    Rastogi, M
    Pruitt, KD
    Schuler, GD
    Bittner, ML
    Chen, YD
    Simon, R
    Meltzer, P
    Trent, JM
    Boguski, MS
    [J]. NATURE GENETICS, 1998, 20 (01) : 19 - 23
  • [6] Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring
    Golub, TR
    Slonim, DK
    Tamayo, P
    Huard, C
    Gaasenbeek, M
    Mesirov, JP
    Coller, H
    Loh, ML
    Downing, JR
    Caligiuri, MA
    Bloomfield, CD
    Lander, ES
    [J]. SCIENCE, 1999, 286 (5439) : 531 - 537
  • [7] A combined algorithm for genome-wide prediction of protein function
    Marcotte, EM
    Pellegrini, M
    Thompson, MJ
    Yeates, TO
    Eisenberg, D
    [J]. NATURE, 1999, 402 (6757) : 83 - 86
  • [8] Systematic variation in gene expression patterns in human cancer cell lines
    Ross, DT
    Scherf, U
    Eisen, MB
    Perou, CM
    Rees, C
    Spellman, P
    Iyer, V
    Jeffrey, SS
    Van de Rijn, M
    Waltham, M
    Pergamenschikov, A
    Lee, JCE
    Lashkari, D
    Shalon, D
    Myers, TG
    Weinstein, JN
    Botstein, D
    Brown, PO
    [J]. NATURE GENETICS, 2000, 24 (03) : 227 - 235
  • [9] SHATKAY KH, 2000, P 8 INT C INT SYST M, P317
  • [10] Interpreting patterns of gene expression with self-organizing maps: Methods and application to hematopoietic differentiation
    Tamayo, P
    Slonim, D
    Mesirov, J
    Zhu, Q
    Kitareewan, S
    Dmitrovsky, E
    Lander, ES
    Golub, TR
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (06) : 2907 - 2912