Ultrafast Genomic Database Search using Layered Locality Sensitive Hashing

被引:0
作者
Chakraborty, Angana [1 ]
Bandyopadhyay, Sanghamitra [1 ]
机构
[1] Indian Stat Inst, Machine Intelligence Unit, Kolkata, India
来源
PROCEEDINGS OF 2018 FIFTH INTERNATIONAL CONFERENCE ON EMERGING APPLICATIONS OF INFORMATION TECHNOLOGY (EAIT) | 2018年
关键词
Locality Sensitive Hashing; Genomic Database Search; Sequence Comparison; Sequence Alignment; ALIGNMENT; SEQUENCE;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this article, we will demonstrate Layered Locality Sensitive Hashing in genomic sequence comparison. Locality Sensitive Hashing based algorithms have already been proved to be successful for approximate nearest neighbor search in high dimensional data. Genomic database search is the primary task for homology detection and motif identification. However, the huge genome size and unknown repetitive regions make the task even more difficult. To tackle this problem we have introduced layered locality sensitive hashing for large scale genomic comparisons. As it turns out, the proposed method reduces the search time by 93.6%, while producing results almost as good as the exact ones.
引用
收藏
页数:4
相关论文
共 14 条
  • [1] Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions
    Andoni, Alexandr
    Indyk, Piotr
    [J]. COMMUNICATIONS OF THE ACM, 2008, 51 (01) : 117 - 122
  • [2] [Anonymous], 2004, P ACM INT C MULT
  • [3] [Anonymous], 2 ASE INT C BIG DAT
  • [4] AVID: A global alignment program
    Bray, N
    Dubchak, I
    Pachter, L
    [J]. GENOME RESEARCH, 2003, 13 (01) : 97 - 102
  • [5] LAGAN and Multi-LAGAN: Efficient tools for large-scale multiple alignment of genomic DNA
    Brudno, M
    Do, CB
    Cooper, GM
    Kim, MF
    Davydov, E
    Green, ED
    Sidow, A
    Batzoglou, S
    [J]. GENOME RESEARCH, 2003, 13 (04) : 721 - 731
  • [6] Efficient large-scale sequence comparison by locality-sensitive hashing
    Buhler, J
    [J]. BIOINFORMATICS, 2001, 17 (05) : 419 - 428
  • [7] FOGSAA: Fast Optimal Global Sequence Alignment Algorithm
    Chakraborty, Angana
    Bandyopadhyay, Sanghamitra
    [J]. SCIENTIFIC REPORTS, 2013, 3
  • [8] Accurate anchoring alignment of divergent sequences
    Huang, WC
    Umbach, DM
    Li, LP
    [J]. BIOINFORMATICS, 2006, 22 (01) : 29 - 34
  • [9] Indyk P., 1998, Proceedings of the Thirtieth Annual ACM Symposium on Theory of Computing, P604, DOI 10.1145/276698.276876
  • [10] Kernelized Locality-Sensitive Hashing for Scalable Image Search
    Kulis, Brian
    Grauman, Kristen
    [J]. 2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, : 2130 - 2137