TXTGate: profiling gene groups with text-based information

被引:48
作者
Glenisson, P [1 ]
Coessens, B [1 ]
Van Vooren, S [1 ]
Mathys, J [1 ]
Moreau, Y [1 ]
De Moor, B [1 ]
机构
[1] Katholieke Univ Leuven, Dept Elektrotech ESAT, Fac Toegepaste Wetenschappen, B-3001 Heverlee, Belgium
关键词
D O I
10.1186/gb-2004-5-6-r43
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
We implemented a framework called TXTGate that combines literature indices of selected public biological resources in a flexible text-mining system designed towards the analysis of groups of genes. By means of tailored vocabularies, term-as well as gene-centric views are offered on selected textual fields and MEDLINE abstracts used in LocusLink and the Saccharomyces Genome Database. Subclustering and links to external resources allow for in-depth analysis of the resulting term profiles.
引用
收藏
页数:12
相关论文
共 29 条
[1]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[2]  
BAEZAYATES RA, 1999, MODERN INFORMATION R
[3]   Mining functional information associated with expression arrays [J].
Blaschke C. ;
Oliveros J.C. ;
Valencia A. .
Functional & Integrative Genomics, 2001, 1 (4) :256-268
[4]  
CALOGERO R, 2002, P VIRT C GEN BIOINF, V2, P9
[5]  
Chaussabel D, 2002, GENOME BIOL, V3
[6]   MeKE: discovering the functions of gene products from biomedical literature via sentence alignment [J].
Chiang, JH ;
Yu, HC .
BIOINFORMATICS, 2003, 19 (11) :1417-1422
[7]   Cluster analysis and display of genome-wide expression patterns [J].
Eisen, MB ;
Spellman, PT ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) :14863-14868
[8]  
GERSTEIN M, NATURE ONLINE
[9]  
Glenisson P, 2003, Pac Symp Biocomput, P391
[10]  
GLENISSON P, 2003, 0397 ESAT SISTA