Index Structures for Fast Similarity Search for Real Vectors. II*

被引:4
|
作者
Rachkovskij, D. A. [1 ,2 ]
机构
[1] NAS Ukraine, Int Sci Educ Ctr Informat Technol & Syst, Kiev, Ukraine
[2] MES Ukraine, Kiev, Ukraine
关键词
similarity search; nearest neighbor; near neighbor; index structure; branch and bound method; tree and forest; clustering; proximity graph; locality-sensitive hashing;
D O I
10.1007/s10559-018-0034-z
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This survey article considers index structures for fast similarity search for objects represented by real-valued vectors. Structures for both exact and faster but approximate similarity search are considered. Index structures based on partitioning into regions (including hierarchical ones) and on proximity graphs are mainly presented. The acceleration of similarity search using the transformation of initial data is also discussed. The ideas of concrete algorithms including recently proposed ones are outlined. The approaches to the acceleration of similarity search in index structures of the considered types and also on the basis of similarity-preserving hashing are discussed and compared.
引用
收藏
页码:320 / 335
页数:16
相关论文
共 50 条
  • [41] Fast Adaptive Similarity Search through Variance-Aware Quantization
    Paparrizos, John
    Edian, Ikraduya
    Liu, Chunwei
    Elmore, Aaron J.
    Franklin, Michael J.
    2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 2969 - 2983
  • [42] A fast and scalable similarity search in high-dimensional image datasets
    Hanyf Y.
    Silkan H.
    International Journal of Computer Applications in Technology, 2019, 59 (01): : 95 - 104
  • [43] FRORSS: Fast Result Object Retrieval using Similarity Search on Cloud
    Raghavendra, S.
    Nithyashree, K.
    Geeta, C. M.
    Buyya, Rajkumar
    Venugopal, K. R.
    Iyengar, S. S.
    Patnaik, L. M.
    PROCEEDINGS OF 2016 IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING, VLSI, ELECTRICAL CIRCUITS AND ROBOTICS (DISCOVER), 2016, : 107 - 112
  • [44] Fast and Flexible Top-k Similarity Search on Large Networks
    Zhang, Jing
    Tang, Jie
    Ma, Cong
    Tong, Hanghang
    Jing, Yu
    Li, Juanzi
    Luyten, Walter
    Moens, Marie-Francine
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2017, 36 (02)
  • [45] A fast and scalable similarity search in high-dimensional image datasets
    Hanyf, Youssef
    Silkan, Hassan
    INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2019, 59 (01) : 95 - 104
  • [46] A fast malware detection model based on heterogeneous graph similarity search
    Li, Tun
    Shou, Peng
    Wan, Xin
    Li, Qian
    Wang, Rong
    Jia, Chaolong
    Xiao, Yunpeng
    COMPUTER NETWORKS, 2024, 254
  • [47] Metric Index: An efficient and scalable solution for precise and approximate similarity search
    Novak, David
    Batko, Michal
    Zezula, Pavel
    INFORMATION SYSTEMS, 2011, 36 (04) : 721 - 733
  • [48] Effectiveness of NAQ-tree as index structure for similarity search in high-dimensional metric space
    Zhang, Ming
    Alhajj, Reda
    KNOWLEDGE AND INFORMATION SYSTEMS, 2010, 22 (01) : 1 - 26
  • [49] Real-Valued Embeddings and Sketches for Fast Distance and Similarity Estimation
    Rachkovskij, D. A.
    CYBERNETICS AND SYSTEMS ANALYSIS, 2016, 52 (06) : 967 - 988
  • [50] A log square average case algorithm to make insertions in fast similarity search
    Mico, Luisa
    Oncina, Jose
    PATTERN RECOGNITION LETTERS, 2012, 33 (09) : 1060 - 1065