An automated real-time integration and interoperability framework for bioinformatics

被引:7
作者
Lopes, Pedro [1 ]
Oliveira, Jose Luis [1 ]
机构
[1] Univ Aveiro, DETI IEETA, P-3810193 Aveiro, Portugal
关键词
Data integration; Interoperability; Publish/subscribe; Integration-as-a-service; Intelligent ETL; Workflow; Cloud; Service-oriented architecture; Event-driven; WEB SERVICES; PLATFORM; GALAXY; GENE;
D O I
10.1186/s12859-015-0761-3
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: In recent years data integration has become an everyday undertaking for life sciences researchers. Aggregating and processing data from disparate sources, whether through specific developed software or via manual processes, is a common task for scientists. However, the scope and usability of the majority of current integration tools fail to deal with the fast growing and highly dynamic nature of biomedical data. Results: In this work we introduce a reactive and event-driven framework that simplifies real-time data integration and interoperability. This platform facilitates otherwise difficult tasks, such as connecting heterogeneous services, indexing, linking and transferring data from distinct resources, or subscribing to notifications regarding the timeliness of dynamic data. For developers, the framework automates the deployment of integrative and interoperable bioinformatics applications, using atomic data storage for content change detection, and enabling agent-based intelligent extract, transform and load tasks. Conclusions: This work bridges the gap between the growing number of services, accessing specific data sources or algorithms, and the growing number of users, performing simple integration tasks on a recurring basis, through a streamlined workspace available to researchers and developers alike.
引用
收藏
页数:13
相关论文
共 43 条
[1]   An agent- and ontology-based system for integrating public gene, protein, and disease databases [J].
Alonso-Calvo, R. ;
Maojo, V. ;
Billhardt, H. ;
Martin-Sanchez, F. ;
Garcia-Remesal, M. ;
Perez-Rey, D. .
JOURNAL OF BIOMEDICAL INFORMATICS, 2007, 40 (01) :17-29
[2]  
[Anonymous], 2004, Service-oriented architecture
[3]  
[Anonymous], 2005, P AG SEM AAAI FALL S
[4]  
[Anonymous], 2008, 12 ENT DISTR OBJ COM, DOI DOI 10.1109/EDOCW.2008.14
[5]   Wrangling Galaxy's reference data [J].
Blankenberg, Daniel ;
Johnson, James E. ;
Taylor, James ;
Nekrutenko, Anton .
BIOINFORMATICS, 2014, 30 (13) :1917-1919
[6]   Business Integration as a Service [J].
Chang, Victor ;
Walters, Robert John ;
Wills, Gary Brian .
INTERNATIONAL JOURNAL OF CLOUD APPLICATIONS AND COMPUTING, 2012, 2 (01) :16-40
[7]   Frontiers of Real-Time Data Analysis [J].
Croushore, Dean .
JOURNAL OF ECONOMIC LITERATURE, 2011, 49 (01) :72-100
[8]  
Darmont J., 2007, ARCHITECTURE FRAMEWO
[9]   In silico research in the era of cloud computing [J].
Dudley, Joel T. ;
Butte, Atul J. .
NATURE BIOTECHNOLOGY, 2010, 28 (11) :1181-1185
[10]   Cloud Technologies for Bioinformatics Applications [J].
Ekanayake, Jaliya ;
Gunarathne, Thilina ;
Qiu, Judy .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2011, 22 (06) :998-1011