Query Filtering with Low-Dimensional Local Embeddings

被引:1
|
作者
Chavez, Edgar [1 ]
Connor, Richard [2 ]
Vadicamo, Lucia [3 ]
机构
[1] Ctr Invest Cient & Educ Super Ensenada CICESE, Ensenada, Baja California, Mexico
[2] Univ Stirling, Div Math & Comp Sci, Stirling, Scotland
[3] CNR, Inst Informat Sci & Technol ISTI, Via Moruzzi 1, I-56124 Pisa, Italy
来源
SIMILARITY SEARCH AND APPLICATIONS (SISAP 2019) | 2019年 / 11807卷
关键词
Metric search; Extreme pivoting; Supermetric space; Four-point property; Pivot based index;
D O I
10.1007/978-3-030-32047-8_21
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The concept of local pivoting is to partition a metric space so that each element in the space is associated with precisely one of a fixed set of reference objects or pivots. The idea is that each object of the data set is associated with the reference object that is best suited to filter that particular object if it is not relevant to a query, maximising the probability of excluding it from a search. The notion does not in itself lead to a scalable search mechanism, but instead gives a good chance of exclusion based on a tiny memory footprint and a fast calculation. It is therefore most useful in contexts where main memory is at a premium, or in conjunction with another, scalable, mechanism. In this paper we apply similar reasoning to metric spaces which possess the four-point property, which notably include Euclidean, Cosine, Triangular, Jensen-Shannon, and Quadratic Form. In this case, each element of the space can be associated with two reference objects, and a four-point lower-bound property is used instead of the simple triangle inequality. The probability of exclusion is strictly greater than with simple local pivoting; the space required per object and the calculation are again tiny in relative terms. We show that the resulting mechanism can be very effective. A consequence of using the four-point property is that, for m reference points, there are ((2)(m)) pivot pairs to choose from, giving a very good chance of a good selection being available from a small number of distance calculations. Finding the best pair has a quadratic cost with the number of references; however, we provide experimental evidence that good heuristics exist. Finally, we show how the resulting mechanism can be integrated with a more scalable technique to provide a very significant performance improvement, for a very small overhead in build-time and memory cost.
引用
收藏
页码:233 / 246
页数:14
相关论文
共 50 条
  • [1] Query filtering using two-dimensional local embeddings
    Vadicamo, Lucia
    Connor, Richard
    Chavez, Edgar
    INFORMATION SYSTEMS, 2021, 101
  • [2] Gesture Generation with Low-Dimensional Embeddings
    Chiu, Chung-Cheng
    Marsella, Stacy
    AAMAS'14: PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2014, : 781 - 788
  • [3] Low-Dimensional Embeddings for Interaction Design
    Rusu, Marius Mihai
    Schoett, Svenja Yvonne
    Williamson, John H.
    Schmidt, Albrecht
    Murray-Smith, Roderick
    ADVANCED INTELLIGENT SYSTEMS, 2022, 4 (02)
  • [4] Very low-dimensional latent semantic indexing for local query regions
    (Association for Computational Linguistics (ACL)):
  • [5] Interpretation of Structural Preservation in Low-Dimensional Embeddings
    Ghosh, Aindrila
    Nashaat, Mona
    Miller, James
    Quader, Shaikh
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (05) : 2227 - 2240
  • [6] Low-Dimensional Genotype Embeddings for Predictive Models
    Sultan, Syed Fahad
    Guo, Xingzhi
    Skiena, Steven
    13TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND HEALTH INFORMATICS, BCB 2022, 2022,
  • [7] Continuous Character Control with Low-Dimensional Embeddings
    Levine, Sergey
    Wang, Jack M.
    Haraux, Alexis
    Popovic, Zoran
    Koltun, Vladlen
    ACM TRANSACTIONS ON GRAPHICS, 2012, 31 (04):
  • [8] On Low Dimensional Local Embeddings
    Abraham, Ittai
    Bartal, Yair
    Neiman, Ofer
    PROCEEDINGS OF THE TWENTIETH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2009, : 875 - 884
  • [9] Visual Exploration of Relationships and Structure in Low-Dimensional Embeddings
    Eckelt, Klaus
    Hinterreiter, Andreas
    Adelberger, Patrick
    Walchshofer, Conny
    Dhanoa, Vaishali
    Humer, Christina
    Heckmann, Moritz
    Steinparz, Christian
    Streit, Marc
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2023, 29 (07) : 3312 - 3326
  • [10] Low-Dimensional Invariant Embeddings for Universal Geometric Learning
    Dym, Nadav
    Gortler, Steven J.
    FOUNDATIONS OF COMPUTATIONAL MATHEMATICS, 2024, 25 (2) : 375 - 415