An Efficient Interval Query Algorithm Based on Inverted List in Cloud Environment

被引:0
作者
Wang, Zhiqiong [1 ]
Gong, Ke [1 ]
Jin, Shikai [1 ]
Li, Wenjun [1 ]
Liu, Zixi [1 ]
机构
[1] Northeastern Univ, Sinodutch Biomed & Informat Engn Sch, Shenyang 110819, Peoples R China
来源
PROCEEDING OF THE IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION | 2012年
关键词
Cloud Computing; Inverted List; Interval Overlap Query; Performance;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Interval overlap query has played a more and more significant role in genomics researches and the development of biomedicine. However, traditional query approches based on single computer cannot handle the problem of limited query speed in the query process properly. A new algorithm based on cloud computing technology named CNCList+ has been proposed to increase the query speed. Nevertheless, the mechanism of CNCList+ that it needs to scan the data of subgroups orderly in every query process reduces the degree of query speed enhancement. Considering the significant role of inverted list in data idex area, the concept of inverted list and the technique of cloud computing are combined together in this paper, forming an efficient query algorithm named IQIL to futher speed up the query speed. In addition, detailed comparison experiments between IQIL and CNCList+ prove the superior performance of IQIL on query speed, thus demonstrating the extraordinary ability of IQIL on solving the limited query speed problem of interval overlap query.
引用
收藏
页码:221 / 225
页数:5
相关论文
共 13 条
[1]   Nested containment list (NCList): a new algorithm for accelerating interval query of genome alignment and interval databases [J].
Alekseyenko, Alexander V. ;
Lee, Christopher J. .
BIOINFORMATICS, 2007, 23 (11) :1386-1393
[2]  
Enderle J., 2004, Proceedings of the 2004 ACM SIGMOD International Conference on Management of Data, P683, DOI DOI 10.1145/1007568.1007645
[3]   GALA, a database for genomic sequence alignments and annotations [J].
Giardine, B ;
Elnitski, L ;
Riemer, C ;
Makalowska, L ;
Schwartz, S ;
Miller, W ;
Hardison, RC .
GENOME RESEARCH, 2003, 13 (04) :732-741
[4]  
Guo L, 2005, PROC INT CONF DATA, P298
[5]   The human genome browser at UCSC [J].
Kent, WJ ;
Sugnet, CW ;
Furey, TS ;
Roskin, KM ;
Pringle, TH ;
Zahler, AM ;
Haussler, D .
GENOME RESEARCH, 2002, 12 (06) :996-1006
[6]  
Khancome C., 2009, P INT C COMP TECHN D
[7]  
Kolovson C. P., 1991, SIGMOD Record, V20, P138, DOI 10.1145/119995.115807
[8]  
KRIEGEL H., 2000, Proc. 26th Int. Conf. on Very Large Databases (VLDB), P407
[9]   Combination tree for mining frequent patterns based on inverted list [J].
Liu Yong ;
Hu Yun-Fa .
2006 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, PTS 1 AND 2, PROCEEDINGS, 2006, :805-808
[10]  
Liu Y, 2006, PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, P1320