An automated real-time integration and interoperability framework for bioinformatics

被引:7
作者
Lopes, Pedro [1 ]
Oliveira, Jose Luis [1 ]
机构
[1] Univ Aveiro, DETI IEETA, P-3810193 Aveiro, Portugal
来源
BMC BIOINFORMATICS | 2015年 / 16卷
关键词
Data integration; Interoperability; Publish/subscribe; Integration-as-a-service; Intelligent ETL; Workflow; Cloud; Service-oriented architecture; Event-driven; WEB SERVICES; PLATFORM; GALAXY; GENE;
D O I
10.1186/s12859-015-0761-3
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: In recent years data integration has become an everyday undertaking for life sciences researchers. Aggregating and processing data from disparate sources, whether through specific developed software or via manual processes, is a common task for scientists. However, the scope and usability of the majority of current integration tools fail to deal with the fast growing and highly dynamic nature of biomedical data. Results: In this work we introduce a reactive and event-driven framework that simplifies real-time data integration and interoperability. This platform facilitates otherwise difficult tasks, such as connecting heterogeneous services, indexing, linking and transferring data from distinct resources, or subscribing to notifications regarding the timeliness of dynamic data. For developers, the framework automates the deployment of integrative and interoperable bioinformatics applications, using atomic data storage for content change detection, and enabling agent-based intelligent extract, transform and load tasks. Conclusions: This work bridges the gap between the growing number of services, accessing specific data sources or algorithms, and the growing number of users, performing simple integration tasks on a recurring basis, through a streamlined workspace available to researchers and developers alike.
引用
收藏
页数:13
相关论文
共 43 条
  • [1] An agent- and ontology-based system for integrating public gene, protein, and disease databases
    Alonso-Calvo, R.
    Maojo, V.
    Billhardt, H.
    Martin-Sanchez, F.
    Garcia-Remesal, M.
    Perez-Rey, D.
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2007, 40 (01) : 17 - 29
  • [2] [Anonymous], 2004, Service-oriented architecture
  • [3] [Anonymous], 2005, P AG SEM AAAI FALL S
  • [4] [Anonymous], 2008, 12 ENT DISTR OBJ COM, DOI DOI 10.1109/EDOCW.2008.14
  • [5] Wrangling Galaxy's reference data
    Blankenberg, Daniel
    Johnson, James E.
    Taylor, James
    Nekrutenko, Anton
    [J]. BIOINFORMATICS, 2014, 30 (13) : 1917 - 1919
  • [6] Business Integration as a Service
    Chang, Victor
    Walters, Robert John
    Wills, Gary Brian
    [J]. INTERNATIONAL JOURNAL OF CLOUD APPLICATIONS AND COMPUTING, 2012, 2 (01) : 16 - 40
  • [7] Frontiers of Real-Time Data Analysis
    Croushore, Dean
    [J]. JOURNAL OF ECONOMIC LITERATURE, 2011, 49 (01) : 72 - 100
  • [8] Darmont J., 2007, ARCHITECTURE FRAMEWO
  • [9] In silico research in the era of cloud computing
    Dudley, Joel T.
    Butte, Atul J.
    [J]. NATURE BIOTECHNOLOGY, 2010, 28 (11) : 1181 - 1185
  • [10] Cloud Technologies for Bioinformatics Applications
    Ekanayake, Jaliya
    Gunarathne, Thilina
    Qiu, Judy
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2011, 22 (06) : 998 - 1011