Constructing a molecular interaction network for thyroid cancer via large-scale text mining of gene and pathway events

被引:6
作者
Wu, Chengkun [1 ]
Schwartz, Jean-Marc [2 ]
Brabant, Georg [3 ,4 ]
Peng, Shao-Liang [1 ]
Nenadic, Goran [5 ,6 ,7 ]
机构
[1] Natl Univ Def Technol, Sch Comp Sci, Changsha 410073, Hunan, Peoples R China
[2] Univ Manchester, Fac Life Sci, Manchester M13 9PT, Lancs, England
[3] Univ Manchester, Christie Hosp, Dept Endocrinol, Manchester M20 4BX, Lancs, England
[4] Med Univ Lubeck, Med Clin 1, Expt & Clin Endocrinol, D-23538 Lubeck, Germany
[5] Manchester Inst Biotechnol, 131 Princess St, Manchester M1 7DN, Lancs, England
[6] Univ Manchester, Sch Comp Sci, Manchester M13 9PL, Lancs, England
[7] Hlth E Res Ctr HeRC, Farr Inst Hlth Informat Res, Manchester M13 9PL, Lancs, England
关键词
IDENTIFICATION; NORMALIZATION; EXTRACTION; BIOLOGY; SYSTEM;
D O I
10.1186/1752-0509-9-S6-S5
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Biomedical studies need assistance from automated tools and easily accessible data to address the problem of the rapidly accumulating literature. Text-mining tools and curated databases have been developed to address such needs and they can be applied to improve the understanding of molecular pathogenesis of complex diseases like thyroid cancer. Results: We have developed a system, PWTEES, which extracts pathway interactions from the literature utilizing an existing event extraction tool (TEES) and pathway named entity recognition (PathNER). We then applied the system on a thyroid cancer corpus and systematically extracted molecular interactions involving either genes or pathways. With the extracted information, we constructed a molecular interaction network taking genes and pathways as nodes. Using curated pathway information and network topological analyses, we highlight key genes and pathways involved in thyroid carcinogenesis. Conclusions: Mining events involving genes and pathways from the literature and integrating curated pathway knowledge can help improve the understanding of molecular interactions of complex diseases. The system developed for this study can be applied in studies other than thyroid cancer. The source code is freely available online at https://github.com/chengkun-wu/PWTEES.
引用
收藏
页数:10
相关论文
共 44 条
[1]   Text mining and its potential applications in systems biology [J].
Ananiadou, Sophia ;
Kell, Douglas B. ;
Tsujii, Jun-ichi .
TRENDS IN BIOTECHNOLOGY, 2006, 24 (12) :571-579
[2]   Event extraction for systems biology by text mining the literature [J].
Ananiadou, Sophia ;
Pyysalo, Sampo ;
Tsujii, Jun'ichi ;
Kell, Douglas B. .
TRENDS IN BIOTECHNOLOGY, 2010, 28 (07) :381-390
[3]  
[Anonymous], 2013, P BIONLP SHAR TASK 2
[4]  
[Anonymous], BMC SYST BIOL S4
[5]  
Bjorne J., 2013, Proceedings of the BioNLP Shared Task 2013 Workshop, P16
[6]   Complex event extraction at PubMed scale [J].
Bjorne, Jari ;
Ginter, Filip ;
Pyysalo, Sampo ;
Tsujii, Jun'ichi ;
Salakoski, Tapio .
BIOINFORMATICS, 2010, 26 (12) :i382-i390
[7]   Pathway Commons at Virtual Cell: use of pathway data for mathematical modeling [J].
Blinov, Michael L. ;
Schaff, James C. ;
Ruebenacker, Oliver ;
Wei, Xintao ;
Vasilescu, Dan ;
Gao, Fei ;
Morgan, Frank ;
Ye, Li ;
Lakshminarayana, Anuradha ;
Moraru, Ion I. ;
Loew, Leslie M. .
BIOINFORMATICS, 2014, 30 (02) :292-294
[8]   Getting started in text mining [J].
Cohen, K. Bretonnel ;
Hunter, Lawrence .
PLOS COMPUTATIONAL BIOLOGY, 2008, 4 (01)
[9]   Reactome: a database of reactions, pathways and biological processes [J].
Croft, David ;
O'Kelly, Gavin ;
Wu, Guanming ;
Haw, Robin ;
Gillespie, Marc ;
Matthews, Lisa ;
Caudy, Michael ;
Garapati, Phani ;
Gopinath, Gopal ;
Jassal, Bijay ;
Jupe, Steven ;
Kalatskaya, Irina ;
Mahajan, Shahana ;
May, Bruce ;
Ndegwa, Nelson ;
Schmidt, Esther ;
Shamovsky, Veronica ;
Yung, Christina ;
Birney, Ewan ;
Hermjakob, Henning ;
D'Eustachio, Peter ;
Stein, Lincoln .
NUCLEIC ACIDS RESEARCH, 2011, 39 :D691-D697
[10]   Topological analysis and interactive visualization of biological networks and protein structures [J].
Doncheva, Nadezhda T. ;
Assenov, Yassen ;
Domingues, Francisco S. ;
Albrecht, Mario .
NATURE PROTOCOLS, 2012, 7 (04) :670-685