PLAN2L: a web tool for integrated text mining and literature-based bioentity relation extraction

被引:23
作者
Krallinger, Martin [1 ]
Rodriguez-Penagos, Carlos [2 ]
Tendulkar, Ashish [1 ]
Valencia, Alfonso [1 ]
机构
[1] Spanish Natl Canc Ctr CNIO, Struct Biol & Biocomp Programme, Madrid 28029, Spain
[2] Barcelona Media Ctr Innovacio, Barcelona, Spain
关键词
ARABIDOPSIS GENOME; ANNOTATION;
D O I
10.1093/nar/gkp484
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
There is an increasing interest in using literature mining techniques to complement information extracted from annotation databases or generated by bioinformatics applications. Here we present PLAN2L, a web-based online search system that integrates text mining and information extraction techniques to access systematically information useful for analyzing genetic, cellular and molecular aspects of the plant model organism Arabidopsis thaliana. Our system facilitates a more efficient retrieval of information relevant to heterogeneous biological topics, from implications in biological relationships at the level of protein interactions and gene regulation, to sub-cellular locations of gene products and associations to cellular and developmental processes, i.e. cell cycle, flowering, root, leaf and seed development. Beyond single entities, also predefined pairs of entities can be provided as queries for which literature-derived relations together with textual evidences are returned. PLAN2L does not require registration and is freely accessible at http://zope.bioinfo.cnio.es/plan2l.
引用
收藏
页码:W160 / W165
页数:6
相关论文
共 12 条
  • [1] ABNEY S, 1996, J NATURAL LANGUAGE E, V2, P337
  • [2] [Anonymous], P INT C NEW METH LAN
  • [3] [Anonymous], CURR PROTOC BIOINFOR
  • [4] Dragon plant biology explorer. A text-mining tool for integrating associations between genetic and biochemical entities with genome annotation and biochemical terms lists
    Bajic, VB
    Veronika, M
    Veladandi, PS
    Meka, A
    Heng, MW
    Rajaraman, K
    Pan, H
    Swarup, S
    [J]. PLANT PHYSIOLOGY, 2005, 138 (04) : 1914 - 1925
  • [5] Functional annotation of the Arabidopsis genome using controlled vocabularies
    Berardini, TZ
    Mundodi, S
    Reiser, L
    Huala, E
    Garcia-Hernandez, M
    Zhang, PF
    Mueller, LA
    Yoon, J
    Doyle, A
    Lander, G
    Moseyko, N
    Yoo, D
    Xu, I
    Zoeckler, B
    Montoya, M
    Miller, N
    Weems, D
    Rhee, SY
    [J]. PLANT PHYSIOLOGY, 2004, 135 (02) : 745 - 755
  • [6] The Arabidopsis genome:: A foundation for plant research
    Bevan, M
    Walsh, S
    [J]. GENOME RESEARCH, 2005, 15 (12) : 1632 - 1642
  • [7] Implementing the iHOP concept for navigation of biomedical literature
    Hoffmann, R
    Valencia, A
    [J]. BIOINFORMATICS, 2005, 21 : 252 - 258
  • [8] Evaluation of text-mining systems for biology: overview of the Second BioCreative community challenge
    Krallinger, Martin
    Morgan, Alexander
    Smith, Larry
    Leitner, Florian
    Tanabe, Lorraine
    Wilbur, John
    Hirschman, Lynette
    Valencia, Alfonso
    [J]. GENOME BIOLOGY, 2008, 9
  • [9] Linking genes to literature: text mining, information extraction, and retrieval applications for biology
    Krallinger, Martin
    Valencia, Alfonso
    Hirschman, Lynette
    [J]. GENOME BIOLOGY, 2008, 9
  • [10] Textpresso:: An ontology-based information retrieval and extraction system for biological literature
    Müller, HM
    Kenny, EE
    Sternberg, PW
    [J]. PLOS BIOLOGY, 2004, 2 (11): : 1984 - 1998