Indexing Earth Mover's Distance over Network Metrics

被引:3
|
作者
Wang, Ting [1 ]
Meng, Shicong [2 ]
Bian, Jiang [3 ]
机构
[1] IBM Corp, Thomas J Watson Res Ctr, Yorktown Hts, NY 10598 USA
[2] Facebook Inc, Menlo Pk, CA 94025 USA
[3] Microsoft Res Asia, Beijing 100080, Peoples R China
关键词
Earth mover's distance; network metrics; similarity search; delimit and filter;
D O I
10.1109/TKDE.2014.2373359
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Earth Mover's Distance (EMD) is a well-known distance metric for data represented as probability distributions over a predefined feature space. Supporting EMD-based similarity search has attracted intensive research effort. Despite the plethora of literature, most existing solutions are optimized for L-p feature spaces (e.g., Euclidean space); while in a spectrum of applications, the relationships between features are better captured using networks. In this paper, we study the problem of answering k-nearest neighbor (k-NN) queries under network-based EMD metrics (NEMD). We propose OASIS, a new access method which leverages the network structure of feature space and enables efficient NEMD-based similarity search. Specifically, OASIS employs three novel techniques: (i) Range Oracle, a scalable model to estimate the range of k-th nearest neighbor under NEMD, (ii) Boundary Index, a structure that efficiently fetches candidates within given range, and (iii) Network Compression Hierarchy, an incremental filtering mechanism that effectively prunes false positive candidates to save unnecessary computation. Through extensive experiments using both synthetic and real data sets, we confirmed that OASIS significantly outperforms the state-of-the-art methods in query processing cost.
引用
收藏
页码:1588 / 1601
页数:14
相关论文
共 50 条
  • [31] Hyperspectral Image Few-Shot Classification Network Based on the Earth Mover's Distance
    Sun, Jiaxing
    Shen, Xiaobo
    Sun, Quansen
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [32] Distance metrics for data interpolation over large areas on Earth's surface
    Krivoruchko, Konstantin
    Gribov, Alexander
    SPATIAL STATISTICS, 2020, 35
  • [33] Using Earth Mover's Distance for audio clip retrieval
    Peng, Yuxin
    Fang, Cuihua
    Chen, Xiaoou
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2006, PROCEEDINGS, 2006, 4261 : 405 - +
  • [34] An Earth Mover's Distance Based Graph Distance Metric For Financial Statements
    Noels, Sander
    Vandermarlieret, Benjamin
    Bastiaensent, Ken
    De Bie, Tijl
    2022 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE FOR FINANCIAL ENGINEERING AND ECONOMICS (CIFER), 2022,
  • [35] The Earth Mover's Distance as a Metric for the Space of Inorganic Compositions
    Hargreaves, Cameron J.
    Dyer, Matthew S.
    Gaultois, Michael W.
    Kurlin, Vitaliy A.
    Rosseinsky, Matthew J.
    CHEMISTRY OF MATERIALS, 2020, 32 (24) : 10610 - 10620
  • [36] Matching point sets with respect to the Earth Mover's Distance
    Cabello, Sergio
    Giannopoulos, Panos
    Knauer, Christian
    Rote, Gunter
    COMPUTATIONAL GEOMETRY-THEORY AND APPLICATIONS, 2008, 39 (02): : 118 - 133
  • [37] Extending Earth Mover's Distance to Occluded Face Verification
    Vidal, Pedro
    Chu, Henry
    Biesseck, Bernardo
    Granada, Roger
    Fuhr, Gustavo
    Menotti, David
    2023 36TH CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES, SIBGRAPI 2023, 2023, : 49 - 54
  • [38] Accurate Approximation of the Earth Mover's Distance in Linear Time
    Min-Hee Jang
    Sang-Wook Kim
    Christos Faloutsos
    Sunju Park
    Journal of Computer Science & Technology, 2014, (01) : 142 - 154
  • [39] METRIC-PRESERVING REDUCTION OF EARTH MOVER'S DISTANCE
    Takano, Yuichi
    Yamamoto, Yoshitsugu
    ASIA-PACIFIC JOURNAL OF OPERATIONAL RESEARCH, 2010, 27 (01) : 39 - 54
  • [40] Learning quantum data with the quantum earth mover's distance
    Kiani, Bobak Toussi
    De Palma, Giacomo
    Marvian, Milad
    Liu, Zi-Wen
    Lloyd, Seth
    QUANTUM SCIENCE AND TECHNOLOGY, 2022, 7 (04)