Validating text mining results on protein-protein interactions using gene expression profiles

被引:0
作者
Zhou, Deyu [1 ]
He, Yulan [1 ]
Kwoh, Chee Keong [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Nanyang Ave, Singapore 639798, Singapore
来源
2006 INTERNATIONAL CONFERENCE ON BIOMEDICAL AND PHARMACEUTICAL ENGINEERING, VOLS 1 AND 2 | 2006年
关键词
D O I
暂无
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Protein-protein interactions referring to the associations of protein molecules are crucial for many biological functions. Since most knowledge about them still hides in biological publications, there is an increasing focus on mining information from the vast amount of biological literature such as MedLine. Many approaches, such as pattern matching, shallow parsing and deep parsing, have been proposed to automatically extract protein-protein interaction information from text sources, with however limited success. Moreover, to the best of our knowledge, none of the existing approaches have performed automatic validation on the mining results. In this paper, we describe a novel framework in which text mining results are automatically validated using the knowledge mined from gene expression profiles. A probability model is proposed to score the confidence of protein-protein interactions based on both text mining results and gene expression profiles. Experimental results are presented to show the feasibility of this framework.
引用
收藏
页码:577 / +
页数:3
相关论文
共 17 条
[1]   Correlation between gene expression profiles and protein-protein interactions within and across genomes [J].
Bhardwaj, N ;
Lu, H .
BIOINFORMATICS, 2005, 21 (11) :2730-2738
[2]  
GOLLUB J, 2003, NUCL ACIDS RES, V31
[3]  
GRIGORIEVA A, NUCL ACIDS RES, V29
[4]   Semantic processing using the hidden vector state model [J].
He, Y ;
Young, S .
COMPUTER SPEECH AND LANGUAGE, 2005, 19 (01) :85-106
[5]   Discovering patterns to extract protein-protein interactions from full texts [J].
Huang, ML ;
Zhu, XY ;
Hao, Y ;
Payan, DG ;
Qu, KB ;
Li, M .
BIOINFORMATICS, 2004, 20 (18) :3604-3612
[6]  
JANSEN R, 2002, RELATING WHOLE GENOM
[7]  
JOSHUA M, 2003, BIOINFORMATICS, V19, P2046
[8]   A shallow parser based on closed-class words to capture relations in biomedical text [J].
Leroy, G ;
Chen, HC ;
Martinez, JD .
JOURNAL OF BIOMEDICAL INFORMATICS, 2003, 36 (03) :145-158
[9]  
MARK C, 1999, P 7 INT C INT SYST M, P77
[10]   Automated extraction of information on protein-protein interactions from the biological literature [J].
Ono, T ;
Hishigaki, H ;
Tanigami, A ;
Takagi, T .
BIOINFORMATICS, 2001, 17 (02) :155-161