TXTGate: profiling gene groups with text-based information

被引:48
作者
Glenisson, P [1 ]
Coessens, B [1 ]
Van Vooren, S [1 ]
Mathys, J [1 ]
Moreau, Y [1 ]
De Moor, B [1 ]
机构
[1] Katholieke Univ Leuven, Dept Elektrotech ESAT, Fac Toegepaste Wetenschappen, B-3001 Heverlee, Belgium
关键词
D O I
10.1186/gb-2004-5-6-r43
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
We implemented a framework called TXTGate that combines literature indices of selected public biological resources in a flexible text-mining system designed towards the analysis of groups of genes. By means of tailored vocabularies, term-as well as gene-centric views are offered on selected textual fields and MEDLINE abstracts used in LocusLink and the Saccharomyces Genome Database. Subclustering and links to external resources allow for in-depth analysis of the resulting term profiles.
引用
收藏
页数:12
相关论文
共 29 条
[11]   Analysis of genomic and proteomic data using advanced literature mining [J].
Hu, YH ;
Hines, LM ;
Weng, HF ;
Zuo, DM ;
Rivera, M ;
Richardson, A ;
LaBaer, J .
JOURNAL OF PROTEOME RESEARCH, 2003, 2 (04) :405-412
[12]  
Jain K, 1988, Algorithms for clustering data
[13]   A literature network of human genes for high-throughput analysis of gene expression [J].
Jenssen, TK ;
Lægreid, A ;
Komorowski, J ;
Hovig, E .
NATURE GENETICS, 2001, 28 (01) :21-+
[14]   Promoter swapping between the genes for a novel zinc finger protein and beta-catenin in pleiomorphic adenomas with t(3;8)(p21;q12) translocations [J].
Kas, K ;
Voz, ML ;
Roijer, E ;
Astrom, AK ;
Meyen, E ;
Stenman, G ;
VandeVen, WJM .
NATURE GENETICS, 1997, 15 (02) :170-174
[15]   eVOC: A controlled vocabulary for unifying gene expression data [J].
Kelso, J ;
Visagie, J ;
Theiler, G ;
Christoffels, A ;
Bardien, S ;
Smedley, D ;
Otgaar, D ;
Greyling, G ;
Jongeneel, CV ;
McCarthy, MI ;
Hide, T ;
Hide, W .
GENOME RESEARCH, 2003, 13 (06) :1222-1230
[16]   Finding relevant references to genes and proteins in Medline using a Bayesian approach [J].
Leonard, JE ;
Colombe, JB ;
Levy, JL .
BIOINFORMATICS, 2002, 18 (11) :1515-1522
[17]   Use of keyword hierarchies to interpret gene expression patterns [J].
Masys, DR ;
Welsh, JB ;
Fink, JL ;
Gribskov, M ;
Klacansky, I ;
Corbeil, J .
BIOINFORMATICS, 2001, 17 (04) :319-326
[18]   Association of genes to genetically inherited diseases using data mining [J].
Perez-Iratxeta, C ;
Bork, P ;
Andrade, MA .
NATURE GENETICS, 2002, 31 (03) :316-319
[19]   AN ALGORITHM FOR SUFFIX STRIPPING [J].
PORTER, MF .
PROGRAM-AUTOMATED LIBRARY AND INFORMATION SYSTEMS, 1980, 14 (03) :130-137
[20]   RefSeq and LocusLink: NCBI gene-centered resources [J].
Pruitt, KD ;
Maglott, DR .
NUCLEIC ACIDS RESEARCH, 2001, 29 (01) :137-140