Fast and efficient searching of biological data resources-using EB-eye

被引:30
作者
Valentin, Franck
Squizzato, Silvano
Goujon, Mickael
McWilliam, Hamish
Paern, Juri
Lopez, Rodrigo
机构
[1] External Service Group, EMBL-EBI
基金
英国惠康基金; 美国国家卫生研究院;
关键词
text search; biological databases; integration; interoperability; web services; Apache Lucene; RETRIEVAL-SYSTEM;
D O I
10.1093/bib/bbp065
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The EB-eye is a fast and efficient search engine that provides easy and uniform access to the biological data resources hosted at the EMBL-EBI. Currently, users can access information from more than 62 distinct datasets covering some 400 million entries. The data resources represented in the EB-eye include: nucleotide and protein sequences at both the genomic and proteomic levels, structures ranging from chemicals to macro-molecular complexes, gene-expression experiments, binary level molecular interactions as well as reaction maps and pathway models, functional classifications, biological ontologies, and comprehensive literature libraries covering the biomedical sciences and related intellectual property. The EB-eye can be accessed over the web or programmatically using a SOAP Web Services interface. This allows its search and retrieval capabilities to be exploited in workflows and analytical pipe-lines. The EB-eye is a novel alternative to existing biological search and retrieval engines. In this article we describe in detail how to exploit its powerful capabilities.
引用
收藏
页码:375 / 384
页数:10
相关论文
共 21 条
  • [1] BASIC LOCAL ALIGNMENT SEARCH TOOL
    ALTSCHUL, SF
    GISH, W
    MILLER, W
    MYERS, EW
    LIPMAN, DJ
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) : 403 - 410
  • [2] [Anonymous], AG DYN LANG JAV PLAT
  • [3] [Anonymous], NUCL ACIDS RES
  • [4] The Universal Protein Resource (UniProt) 2009
    Bairoch, Amos
    Consortium, UniProt
    Bougueleret, Lydie
    Altairac, Severine
    Amendolia, Valeria
    Auchincloss, Andrea
    Argoud-Puy, Ghislaine
    Axelsen, Kristian
    Baratin, Delphine
    Blatter, Marie-Claude
    Boeckmann, Brigitte
    Bolleman, Jerven
    Bollondi, Laurent
    Boutet, Emmanuel
    Quintaje, Silvia Braconi
    Breuza, Lionel
    Bridge, Alan
    deCastro, Edouard
    Ciapina, Luciane
    Coral, Danielle
    Coudert, Elisabeth
    Cusin, Isabelle
    Delbard, Gwennaelle
    Dornevil, Dolnide
    Roggli, Paula Duek
    Duvaud, Severine
    Estreicher, Anne
    Famiglietti, Livia
    Feuermann, Marc
    Gehant, Sebastian
    Farriol-Mathis, Nathalie
    Ferro, Serenella
    Gasteiger, Elisabeth
    Gateau, Alain
    Gerritsen, Vivienne
    Gos, Arnaud
    Gruaz-Gumowski, Nadine
    Hinz, Ursula
    Hulo, Chantal
    Hulo, Nicolas
    James, Janet
    Jimenez, Silvia
    Jungo, Florence
    Junker, Vivien
    Kappler, Thomas
    Keller, Guillaume
    Lachaize, Corinne
    Lane-Guermonprez, Lydie
    Langendijk-Genevaux, Petra
    Lara, Vicente
    [J]. NUCLEIC ACIDS RESEARCH, 2009, 37 : D169 - D174
  • [5] The Protein Data Bank
    Berman, HM
    Westbrook, J
    Feng, Z
    Gilliland, G
    Bhat, TN
    Weissig, H
    Shindyalov, IN
    Bourne, PE
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 235 - 242
  • [6] Petabyte-scale innovations at the European Nucleotide Archive
    Cochrane, Guy
    Akhtar, Ruth
    Bonfield, James
    Bower, Lawrence
    Demiralp, Fehmi
    Faruque, Nadeem
    Gibson, Richard
    Hoad, Gemma
    Hubbard, Tim
    Hunter, Christopher
    Jang, Mikyung
    Juhos, Szilveszter
    Leinonen, Rasko
    Leonard, Steven
    Lin, Quan
    Lopez, Rodrigo
    Lorenc, Dariusz
    McWilliam, Hamish
    Mukherjee, Gaurab
    Plaister, Sheila
    Radhakrishnan, Rajesh
    Robinson, Stephen
    Sobhany, Siamak
    Hoopen, Petra Ten
    Vaughan, Robert
    Zalunin, Vadim
    Birney, Ewan
    [J]. NUCLEIC ACIDS RESEARCH, 2009, 37 : D19 - D25
  • [7] ChEBI:: a database and ontology for chemical entities of biological interest
    Degtyarenko, Kirill
    de Matos, Paula
    Ennis, Marcus
    Hastings, Janna
    Zbinden, Martin
    McNaught, Alan
    Alcantara, Rafael
    Darsow, Michael
    Guedj, Mickael
    Ashburner, Michael
    [J]. NUCLEIC ACIDS RESEARCH, 2008, 36 : D344 - D350
  • [8] Etzold T, 1996, METHOD ENZYMOL, V266, P114
  • [9] The RESID database of protein modifications as a resource and annotation tool
    Garavelli, JS
    [J]. PROTEOMICS, 2004, 4 (06) : 1527 - 1533
  • [10] MRS: a fast and compact retrieval system for biological data
    Hekkelman, ML
    Vriend, G
    [J]. NUCLEIC ACIDS RESEARCH, 2005, 33 : W766 - W769