Re-identification and information fusion between anonymized CDR and social network data

被引:30
|
作者
Cecaj, Alket [1 ]
Mamei, Marco [1 ]
Zambonelli, Franco [1 ]
机构
[1] Univ Modena & Reggio Emilia, Reggio Emilia, Italy
关键词
Mobility patterns; De-anonymization; Information fusion;
D O I
10.1007/s12652-015-0303-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The analysis of multiple datasets on users' behaviors opens interesting information fusion possibilities and, at the same time, creates a potential for re-identification and de-anonymization of users' data. On the one hand, this kind of approaches can breach users' privacy despite anonymization. On the other hand, combining different datasets is a key enabler for advanced context-awareness in that information from multiple sources can complement and enrich each other. In this work we analyze different anonymized mobility datasets in the direction of highlighting re-identification and information fusion possibilities. In particular we focus on call detail record (CDR) datasets released by mobile telecom operators and datasets comprising geo-localized messages released by social network sites. Results shows that: (1) in line with previous findings, few (about 4) data points are enough to uniquely pin point the majority (90 %) of the users, (2) more than 20 % of CDR users have a single social network user exhibiting a number of matching data points. We speculate that these two users might be the same person. (3) We derive an estimate of the probability of two users begin the same person given the number of data points they have in common, and estimate that for 3 % of the social network users we can find a CDR user very likely (>90 % probability) to be the same person.
引用
收藏
页码:83 / 96
页数:14
相关论文
共 13 条
  • [1] Re-identification and information fusion between anonymized CDR and social network data
    Alket Cecaj
    Marco Mamei
    Franco Zambonelli
    Journal of Ambient Intelligence and Humanized Computing, 2016, 7 : 83 - 96
  • [2] Unsupervised Person Re-Identification Method Based on Multi-Granularity Information Fusion
    Wen, Jing
    Zhang, Fukang
    Computer Engineering and Applications, 2023, 59 (13) : 99 - 109
  • [3] Information fusion from multiple cameras for gait-based re-identification and recognition
    Chattopadhyay, Pratik
    Sural, Shamik
    Mukherjee, Jayanta
    IET IMAGE PROCESSING, 2015, 9 (11) : 969 - 976
  • [4] Person re-identification by order-induced metric fusion
    Mirmahboub, Behzad
    Mekhalfi, Mohamed Lamine
    Murino, Vittorio
    NEUROCOMPUTING, 2018, 275 : 667 - 676
  • [5] Complex networks and social network analysis in information fusion
    Svenson, Pontus
    2006 9TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION, VOLS 1-4, 2006, : 1869 - 1875
  • [6] Rapid Re-Identification Risk Assessment for Anonymous Data Set in Mobile Multimedia Scene
    Yang, Zhigang
    Wang, Ruyan
    Luo, Daizhong
    Xiong, Yu
    IEEE ACCESS, 2020, 8 : 41557 - 41565
  • [7] Finding Overlapping Communities Based on Information Fusion in Social Network
    Jiang, Lina
    Li, Hong
    Wang, Lidong
    Wu, Junjie
    2017 14TH INTERNATIONAL CONFERENCE ON SERVICES SYSTEMS AND SERVICES MANAGEMENT (ICSSSM), 2017,
  • [8] Information fusion oriented heterogeneous social network for friend recommendation via community detection
    Huang, Mingqing
    Jiang, Qingshan
    Qu, Qiang
    Chen, Lifei
    Chen, Hui
    APPLIED SOFT COMPUTING, 2022, 114
  • [9] Visualizing Situational Data: Applying Information Fusion for Detecting Social-Ecological Events
    Altaweel, Mark R.
    Alessa, Lillian N.
    Kliskey, Andrew D.
    SOCIAL SCIENCE COMPUTER REVIEW, 2010, 28 (04) : 497 - 514
  • [10] Semantic Information Fusion of Linked Open Data and Social Big Data for the Creation of an Extended Corporate CRM Database
    Torre-Bastida, Ana I.
    Villar-Rodriguez, Esther
    Del Ser, Javier
    Gil-Lopez, Sergio
    INTELLIGENT DISTRIBUTED COMPUTING VIII, 2015, 570 : 211 - 221