Indexing Earth Mover's Distance over Network Metrics

被引:3
|
作者
Wang, Ting [1 ]
Meng, Shicong [2 ]
Bian, Jiang [3 ]
机构
[1] IBM Corp, Thomas J Watson Res Ctr, Yorktown Hts, NY 10598 USA
[2] Facebook Inc, Menlo Pk, CA 94025 USA
[3] Microsoft Res Asia, Beijing 100080, Peoples R China
关键词
Earth mover's distance; network metrics; similarity search; delimit and filter;
D O I
10.1109/TKDE.2014.2373359
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Earth Mover's Distance (EMD) is a well-known distance metric for data represented as probability distributions over a predefined feature space. Supporting EMD-based similarity search has attracted intensive research effort. Despite the plethora of literature, most existing solutions are optimized for L-p feature spaces (e.g., Euclidean space); while in a spectrum of applications, the relationships between features are better captured using networks. In this paper, we study the problem of answering k-nearest neighbor (k-NN) queries under network-based EMD metrics (NEMD). We propose OASIS, a new access method which leverages the network structure of feature space and enables efficient NEMD-based similarity search. Specifically, OASIS employs three novel techniques: (i) Range Oracle, a scalable model to estimate the range of k-th nearest neighbor under NEMD, (ii) Boundary Index, a structure that efficiently fetches candidates within given range, and (iii) Network Compression Hierarchy, an incremental filtering mechanism that effectively prunes false positive candidates to save unnecessary computation. Through extensive experiments using both synthetic and real data sets, we confirmed that OASIS significantly outperforms the state-of-the-art methods in query processing cost.
引用
收藏
页码:1588 / 1601
页数:14
相关论文
共 50 条
  • [41] The earth mover's distance is the mallows distance: Some insights from statistics
    Levina, E
    Bickel, P
    EIGHTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOL II, PROCEEDINGS, 2001, : 251 - 256
  • [42] A Linear Approximate Algorithm for Earth Mover's Distance with Thresholded Ground Distance
    Li, Longjie
    Ma, Min
    Lei, Peng
    Wang, Xiaoping
    Chen, Xiaoyun
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2014, 2014
  • [43] Localized Earth Mover's Distance for Robust Histogram Comparison
    Won, Kwang Hee
    Jung, Soon Ki
    COMPUTER VISION-ACCV 2010, PT I, 2011, 6492 : 478 - 489
  • [44] Matching point sets with respect to the earth mover's distance
    Cabello, S
    Giannopoulos, P
    Knauer, C
    Rote, G
    ALGORITHMS - ESA 2005, 2005, 3669 : 520 - 531
  • [45] Using earth mover’s distance for viral outbreak investigations
    Andrew Melnyk
    Sergey Knyazev
    Fredrik Vannberg
    Leonid Bunimovich
    Pavel Skums
    Alex Zelikovsky
    BMC Genomics, 21
  • [46] Nonnegative Matrix Factorization with Earth Mover's Distance Metric
    Sandler, Roman
    Lindenbaum, Michael
    CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4, 2009, : 1873 - 1880
  • [47] Wavelet and Earth Mover's Distance Coupling Denoising Techniques
    Zhang, Zhihua
    Xu, Xudong
    Crabbe, M. James C.
    ELECTRONICS, 2023, 12 (17)
  • [48] Data-Dependent LSH for the Earth Mover's Distance
    Jayaram, Rajesh
    Waingarten, Erik
    Zhang, Tian
    PROCEEDINGS OF THE 56TH ANNUAL ACM SYMPOSIUM ON THEORY OF COMPUTING, STOC 2024, 2024, : 800 - 811
  • [49] Earth Mover's Distance as a Comparison Metric for Analog Behavior
    Rath, Alexander W.
    Simon, Sebastian
    Esen, Volkan
    Ecker, Wolfgang
    VLSI-SOC: SYSTEM-ON-CHIP IN THE NANOSCALE ERA - DESIGN, VERIFICATION AND RELIABILITY, 2017, 508 : 173 - 191
  • [50] Accurate Approximation of the Earth Mover's Distance in Linear Time
    Jang, Min-Hee
    Kim, Sang-Wook
    Faloutsos, Christos
    Park, Sunju
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2014, 29 (01) : 142 - 154