Sampled Angles in High-Dimensional Spaces

被引:4
|
作者
Connor, Richard [1 ]
Dearle, Alan [1 ]
机构
[1] Univ St Andrews, Jack Cole Bldg,North Haugh, St Andrews KY16 9SX, Fife, Scotland
来源
SIMILARITY SEARCH AND APPLICATIONS, SISAP 2020 | 2020年 / 12440卷
关键词
Metric search; High dimensional space;
D O I
10.1007/978-3-030-60936-8_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Similarity search using metric indexing techniques is largely a solved problem in low-dimensional spaces. However techniques based only on the triangle inequality property start to fail as dimensionality increases. Since proper metric spaces allow a finite projection of any three objects into a 2D Euclidean space, the notion of angle can be validly applied among any three (but no more) objects. High dimensionality is known to have interesting effects on angles in vector spaces, but to our knowledge this has not been studied in more general metric spaces. Here, we consider the use of angles among objects in combination with distances. As dimensionality becomes higher, we show that the variance in sampled angles reduces. Furthermore, sampled angles also become correlated with inter-object distances, giving different distributions between query solutions and non-solutions. We show the theoretical underpinnings of this observation in unbounded high-dimensional Euclidean spaces, and then examine how the pure property is reflected in some real-world high dimensional spaces. Our experiments on both generated and real world datasets demonstrate that these observations can have an important impact on the tractability of search as dimensionality increases.
引用
收藏
页码:233 / 247
页数:15
相关论文
共 50 条
  • [1] EM in high-dimensional spaces
    Draper, BA
    Elliott, DL
    Hayes, J
    Baek, K
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2005, 35 (03): : 571 - 577
  • [2] The mathematics of high-dimensional spaces
    Rogers, D
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1998, 215 : U524 - U524
  • [3] Containment problems in high-dimensional spaces
    Ishigami, Y
    GRAPHS AND COMBINATORICS, 1995, 11 (04) : 327 - 335
  • [4] Torelli spaces of high-dimensional manifolds
    Ebert, Johannes
    Randal-Williams, Oscar
    JOURNAL OF TOPOLOGY, 2015, 8 (01) : 38 - 64
  • [5] Clustering in high-dimensional data spaces
    Murtagh, FD
    STATISTICAL CHALLENGES IN ASTRONOMY, 2003, : 279 - 292
  • [6] Latent Spaces: The High-Dimensional Infosphere
    Lintunen, Erik
    HALFWAY TO THE FUTURE SYMPOSIUM (HTTF 2019), 2019,
  • [7] Packing hyperspheres in high-dimensional Euclidean spaces
    Skoge, Monica
    Donev, Aleksandar
    Stillinger, Frank H.
    Torquato, Salvatore
    PHYSICAL REVIEW E, 2006, 74 (04)
  • [8] Nearest Neighbor Search in High-Dimensional Spaces
    Andoni, Alexandr
    MATHEMATICAL FOUNDATIONS OF COMPUTER SCIENCE 2011, 2011, 6907 : 1 - 1
  • [9] Stable moduli spaces of high-dimensional handlebodies
    Botvinnik, Boris
    Perlmutter, Nathan
    JOURNAL OF TOPOLOGY, 2017, 10 (01) : 101 - 163
  • [10] Hiding outliers in high-dimensional data spaces
    Steinbuss G.
    Böhm K.
    International Journal of Data Science and Analytics, 2017, 4 (3) : 173 - 189