Estimating Pairwise Distances in Large Graphs

被引:0
作者
Christoforaki, Maria [1 ]
Suel, Torsten [1 ]
机构
[1] NYU, Polytech Sch Engn, Brooklyn, NY 11201 USA
来源
2014 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) | 2014年
基金
美国国家科学基金会;
关键词
FINITE METRIC-SPACES; ALGORITHM;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Point-to-point distance estimation in large scale graphs is a fundamental and well studied problem with applications in many areas such as Social Search. Previous work has focused on selecting an appropriate subset of vertices as landmarks, aiming to derive distance upper or lower bounds that are as tight as possible. In order to compute a distance bound between two vertices, the proposed methods apply triangle inequalities on top of the precomputed distances between each of these vertices and the landmarks, and then use the tightest one. In this work we take a fresh look at this setting and approach it as a learning problem. As features, we use structural attributes of the vertices involved as well as the bounds described above, and we learn a function that predicts the distance between a source and a destination vertex. We conduct an extensive experimental evaluation on a variety of real-world graphs and show that the average relative prediction error of our proposed methods significantly outperforms state-of-the-art landmark-based estimates. Our method is particularily efficient when the available space is very limited.
引用
收藏
页码:335 / 344
页数:10
相关论文
共 36 条
[21]  
Goldberg AV, 2007, LECT NOTES COMPUT SC, V4362, P88
[22]  
Goldberg AV, 2005, PROCEEDINGS OF THE SIXTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P156
[23]  
Jin R., 2012, SIGMOD, P445, DOI DOI 10.1145/2213836.2213887
[24]  
Jin R., 2008, P ACM SIGMOD INT C M, P595, DOI DOI 10.1145/1376616.1376677
[25]  
Jin RM, 2009, ACM SIGMOD/PODS 2009 CONFERENCE, P813
[26]   On the distortion required for embedding finite metric spaces into normed spaces [J].
Matousek, J .
ISRAEL JOURNAL OF MATHEMATICS, 1996, 93 :333-344
[27]  
Miao Qiao, 2011, Scientific and Statistical Database Management. Proceedings 23rd International Conference, SSDBM 2011, P255, DOI 10.1007/978-3-642-22351-8_16
[28]  
Potamias M., 2009, P 18 ACM C INF KNOWL, P867, DOI DOI 10.1145/1645953.1646063
[29]  
Qiao M., 2013, PVLDB, V6
[30]   Approximate Shortest Distance Computing: A Query-Dependent Local Landmark Scheme [J].
Qiao, Miao ;
Cheng, Hong ;
Chang, Lijun ;
Yu, Jeffrey Xu .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (01) :55-68