Spatio-Temporal Linkage over Location-Enhanced Services

被引:15
作者
Basik, Fuat [1 ]
Gedik, Bugra [1 ]
Etemoglu, Cagri [2 ]
Ferhatosmanoglu, Hakan [1 ,3 ]
机构
[1] Bilkent Univ, Dept Comp Engn, TR-06800 Ankara, Turkey
[2] Turk Telekom, TR-4349 Istanbul, Turkey
[3] Univ Warwick, Dept Comp Sci, Coventry, W Midlands, England
基金
美国国家科学基金会;
关键词
ENTITY RESOLUTION;
D O I
10.1109/TMC.2017.2711027
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We are witnessing an enormous growth in the volume of data generated by various online services. An important portion of this data contains geographic references, since many of these services are location-enhanced and thus produce spatio-temporal records of their usage. We postulate that the spatio-temporal usage records belonging to the same real-world entity can be matched across records from different location-enhanced services. Linking spatio-temporal records enables data analysts and service providers to obtain information that they cannot derive by analyzing only one set of usage records. In this paper, we develop a new linkage model that can be used to match entities from two sets of spatio-temporal usage records belonging to two different location-enhanced services. This linkage model is based on the concept of k-l diversity-that we developed to capture both spatial and temporal aspects of the linkage. To realize this linkage model in practice, we develop a scalable linking algorithm called ST-Link, which makes use of effective spatial and temporal filtering mechanisms that significantly reduce the search space for matching users. Furthermore, ST-Link utilizes sequential scan procedures to avoid random disk access and thus scales to large datasets. We evaluated our work with respect to accuracy and performance using several datasets. Experiments show that ST-Link is effective in practice for performing spatio-temporal linkage and can scale to large datasets.
引用
收藏
页码:447 / 460
页数:14
相关论文
共 38 条
[31]  
Patel J., 2004, SIGMOD, P635
[32]   Linking Users Across Domains with Location Data: Theory and Validation [J].
Riederer, Chris ;
Kim, Yunsung ;
Chaintreau, Augustin ;
Korula, Nitish ;
Lattanzi, Silvio .
PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16), 2016, :707-719
[33]  
Rossi L., 2015, CoRR
[34]  
Samet H, 2013, SIGMOD, P169, DOI [10.1145/2463676.2465332, DOI 10.1145/2463676.2465332]
[35]  
Skovsgaard A, 2014, PROC INT CONF DATA, P148, DOI 10.1109/ICDE.2014.6816647
[36]   Joint entity resolution on multiple datasets [J].
Whang, Steven Euijong ;
Garcia-Molina, Hector .
VLDB JOURNAL, 2013, 22 (06) :773-795
[37]   Pay-As-You-Go Entity Resolution [J].
Whang, Steven Euijong ;
Marmaros, David ;
Garcia-Molina, Hector .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (05) :1111-1124
[38]   Privacy-Preserving Aggregate Queries for Optimal Location Selection [J].
Yilmaz, Emre ;
Ferhatosmanoglu, Hakan ;
Ayday, Erman ;
Aksoy, Remzi Can .
IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2019, 16 (02) :329-343