PPLook: an automated data mining tool for protein-protein interaction

被引:12
作者
Zhang, Shao-Wu [1 ]
Li, Yao-Jun [1 ]
Xia, Li [2 ]
Pan, Quan [1 ]
机构
[1] Northwestern Polytech Univ, Inst Control & Informat, Sch Automat, Xian 710072, Peoples R China
[2] Univ So Calif, Dept Biol Sci, Mol & Computat Biol Program, Los Angeles, CA 90089 USA
基金
中国国家自然科学基金;
关键词
INFORMATION; EXTRACTION; SEARCH;
D O I
10.1186/1471-2105-11-326
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Extracting and visualizing of protein-protein interaction (PPI) from text literatures are a meaningful topic in protein science. It assists the identification of interactions among proteins. There is a lack of tools to extract PPI, visualize and classify the results. Results: We developed a PPI search system, termed PPLook, which automatically extracts and visualizes protein-protein interaction (PPI) from text. Given a query protein name, PPLook can search a dataset for other proteins interacting with it by using a keywords dictionary pattern-matching algorithm, and display the topological parameters, such as the number of nodes, edges, and connected components. The visualization component of PPLook enables us to view the interaction relationship among the proteins in a three-dimensional space based on the OpenGL graphics interface technology. PPLook can also provide the functions of selecting protein semantic class, counting the number of semantic class proteins which interact with query protein, counting the literature number of articles appearing the interaction relationship about the query protein. Moreover, PPLook provides heterogeneous search and a user-friendly graphical interface. Conclusions: PPLook is an effective tool for biologists and biosystem developers who need to access PPI information from the literature. PPLook is freely available for non-commercial users at http://meta.usc.edu/softs/PPLook.
引用
收藏
页数:6
相关论文
共 23 条
[1]  
[Anonymous], 1993, COMPUT LINGUIST, DOI DOI 10.21236/ADA273556
[2]  
Blaschke C, 2002, IEEE INTELL SYST, V17, P14, DOI 10.1109/MIS.2002.999215
[3]   Joining the results of heterogeneous search engines [J].
Braga, Daniele ;
Campi, Alessandro ;
Ceri, Stefano ;
Raffio, Alessandro .
INFORMATION SYSTEMS, 2008, 33 (7-8) :658-680
[4]   The BioGRID interaction database:: 2008 update [J].
Breitkreutz, Bobby-Joe ;
Stark, Chris ;
Reguly, Teresa ;
Boucher, Lorrie ;
Breitkreutz, Ashton ;
Livstone, Michael ;
Oughtred, Rose ;
Lackner, Daniel H. ;
Bahler, Jurg ;
Wood, Valerie ;
Dolinski, Kara ;
Tyers, Mike .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D637-D640
[5]   MINT: the molecular INTeraction database [J].
Chatr-aryamontri, Andrew ;
Ceol, Arnaud ;
Palazzi, Luisa Montecchi ;
Nardelli, Giuliano ;
Schneider, Maria Victoria ;
Castagnoli, Luisa ;
Cesareni, Gianni .
NUCLEIC ACIDS RESEARCH, 2007, 35 :D572-D574
[6]  
Chernov S, 2006, LECT NOTES COMPUT SC, V4312, P202
[7]   Discovery of protein-protein interactions using a combination of linguistic, statistical and graphical information [J].
Cooper, JW ;
Kershenbaum, A .
BMC BIOINFORMATICS, 2005, 6 (1)
[8]   Extracting human protein interactions from MEDLINE using a full-sentence parser [J].
Daraselia, N ;
Yuryev, A ;
Egorov, S ;
Novichkova, S ;
Nikitin, A ;
Mazo, I .
BIOINFORMATICS, 2004, 20 (05) :604-U43
[9]  
EOM JH, 2004, GENOMICS INFORMATICS, P99
[10]   iHOP web services [J].
Fernandez, Jose M. ;
Hoffmann, Robert ;
Valencia, Alfonso .
NUCLEIC ACIDS RESEARCH, 2007, 35 :W21-W26