Biodiversity information retrieval across networked data sets

被引:2
作者
Sarinder, K. K. S. [1 ]
Lim, L. H. S. [1 ]
Merican, A. F. [1 ]
Dimyati, K. [2 ]
机构
[1] Univ Malaya, Inst Biol Sci, Kuala Lumpur, Malaysia
[2] Univ Malaya, Dept Elect Engn, Kuala Lumpur, Malaysia
来源
ASLIB PROCEEDINGS | 2010年 / 62卷 / 4-5期
关键词
Information retrieval; Relational databases; Databases; Integration; Distributed databases;
D O I
10.1108/00012531011074744
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Purpose - Biodiversity resources are inevitably digital and stored in a wide variety of formats by researchers or stakeholders. In Malaysia, although digitizing biodiversity data has long been stressed, the interoperability of the biodiversity data is still an issue that requires attention. This is because, when data are shared, the question of copyright occurs, creating a setback among researchers wanting to promote or share data through online presentations. To solve this, the aim is to present an approach to integrate data through wrapping of datasets stored in relational databases located on networked platforms. Design/methodology/approach - The approach uses tools such as XML, PHP, ASP and HTML to integrate distributed databases in heterogeneous formats. Five current database integration systems were reviewed and all of them have common attributes such as query-oriented, using a mediator-based approach and integrating a structured data model. These common attributes were also adopted in the proposed solution. Distributed Generic Information Retrieval (DiGIR) was used as a model in designing the proposed solution. Findings - A new database integration system was developed, which is user-friendly and simple with common attributes found in current integration systems. Originality/value - The proposed system is unique in that it allows biodiversity data sharing, through the integration of biodiversity databases, hence enabling scientists to share information and generate knowledge. It also solves copyright problems by suggesting distributed warehouses, giving data owners the benefit of having their database under their own jurisdiction. It meets the requirements of querying heterogeneous and remote biodiversity databases.
引用
收藏
页码:514 / 522
页数:9
相关论文
共 11 条
[1]  
*BIOD RES CTR, 2005, DISTR GEN INF RETR
[2]   DiscoveryLink: A system for integrated access to life sciences data sources [J].
Haas, LM ;
Schwarz, PM ;
Kodali, P ;
Kotlar, E ;
Rice, JE ;
Swope, WC .
IBM SYSTEMS JOURNAL, 2001, 40 (02) :489-511
[3]  
*INFORMAX, 2001, GENOMAX
[4]  
JONES AC, 2000, P 11 INT C DAT EXP S, P981
[5]   Biological data integration: Wrapping data and tools [J].
Lacroix, Z .
IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE, 2002, 6 (02) :123-128
[6]  
Limsoon Wong, 2000, Journal of Functional Programming, V10, P19, DOI 10.1017/S0956796899003585
[7]  
SARINDER KK, 2007, THESIS U MALAYA KUAL
[8]  
SARINDER KKS, 2009, MALAYSIAN J SCI, V28, P113
[9]   OMG overview: CORBA and the OMA in enterprise computing [J].
Siegel, J .
COMMUNICATIONS OF THE ACM, 1998, 41 (10) :37-43
[10]   Kleisli, its exchange format, supporting tools, and an application in protein interaction extraction [J].
Wong, L .
IEEE INTERNATIONAL SYMPOSIUM ON BIO-INFORMATICS AND BIOMEDICAL ENGINEERING, PROCEEDINGS, 2000, :21-28