Dynamic integration of biological data sources using the data concierge

被引:4
作者
Gong P. [1 ,2 ]
机构
[1] Biomedical and Multimedia Information Technology (BMIT) Research Group, School of Information Technologies, The University of Sydney, Sydney
[2] Department of PET and Nuclear Medicine, RPA Hospital, Camperdown
关键词
Biology; Data integration; Middleware; Ontology; State machine;
D O I
10.1186/2047-2501-1-7
中图分类号
学科分类号
摘要
Background: The ever-changing landscape of large-scale network environments and innovative biology technologies require dynamic mechanisms to rapidly integrate previously unknown bioinformatics sources at runtime. However, existing integration technologies lack sufficient flexibility to adapt to these changes, because the techniques used for integration are static, and sensitive to new or changing bioinformatics source implementations and evolutionary biologist requirements.Methods: To address this challenge, in this paper we propose a new semantics-based adaptive middleware, the Data Concierge, which is able to dynamically integrate heterogeneous biological data sources without the need for wrappers. Along with the architecture necessary to facilitate dynamic integration, API description mechanism is proposed to dynamically classify, recognize, locate, and invoke newly added biological data source functionalities. Based on the unified semantic metadata, XML-based state machines are able to provide flexible configurations to execute biologist's abstract and complex operations.Results and discussion: Experimental results demonstrate that for obtaining dynamic features, the Data Concierge sacrifices reasonable performance on reasoning knowledge models and dynamically doing data source API invocations. The overall costs to integrate new biological data sources are significantly lower when using the Data Concierge.Conclusions: The Data Concierge facilitates the rapid integration of new biological data sources in existing applications with no repetitive software development required, and hence, this mechanism would provide a cost-effective solution to the labor-intensive software engineering tasks. © 2013 Gong; licensee BioMed Central Ltd.
引用
收藏
相关论文
共 49 条
[1]  
Baxevanis A.D., The molecular biology database collection: 2003 update, Nucleic Acids Res, 31, pp. 1-12, (2003)
[2]  
Stocks K.I., Condit C., Qian X., Brewin P.E., Gupta A., Bringing together an ocean of information: an extensible data integration framework for biological oceanography, Deep-Sea Research II, 56, pp. 1804-1811, (2009)
[3]  
Goble C., Stevens R., State of the nation in data integration for bioinformatics, J Biomed Inform, 41, pp. 687-693, (2008)
[4]  
Sullivan D.E., Gabbard J.L., Shukla M., Sobral B., Data integration for dynamic and sustainable systems biology resources: challenges and lessons learned, Chem Biodivers, 7, pp. 1124-1141, (2010)
[5]  
Hernandez T., Kambhampati S., Integration of biological sources: current systems and challenges ahead, In ACM SIGMOD Record, ACM Press, 33, pp. 51-60, (2004)
[6]  
Mougin F., Burgun A., Loreal O., Beux P.L., Towards the automatic generation of biomedical sources schema, Medinfo, 11, pp. 783-787, (2004)
[7]  
Stevens R., Goble C.A., Baker P., Brass A., A classification of tasks in bioinformatics, Bioinformatics, 17, pp. 180-188, (2001)
[8]  
Achard F., Vaysseix G., Barillot E., XML, bioinformatics, and data integration, Bioinformatics Review, 17, pp. 115-125, (2001)
[9]  
Aldana J.F., Roldan-Castro M., Navas I., Roldan-Garcia M.M., Hidalgo-Conde M., Trelles O., Bio-Broker: a tool for integration of biological data sources and data analysis tools, Software - Practice and Experience, 36, pp. 1585-1604, (2006)
[10]  
Finn R.D., Gardner P.P., Bateman A., Making your database available through Wikipedia: the pros and cons, Nucl. Acids Res., 40, (2012)