Web browsing privacy in the deep learning era: Beyond VPNs and encryption

被引:2
|
作者
Perdices, Daniel [1 ]
de Vergara, Jorge E. Lopez [1 ,2 ]
Gonzalez, Ivan [1 ,2 ]
de Pedro, Luis [1 ,2 ]
机构
[1] Univ Autonoma Madrid, Escuela Politecn Super, Dept Elect & Commun Technol, Madrid, Spain
[2] Naudit High Performance Comp & Networking SL, Madrid, Spain
关键词
Web browsing analytics; Neural network; Privacy; Deep learning; Transformer; MOBILE APP IDENTIFICATION; INTERNET;
D O I
10.1016/j.comnet.2022.109471
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Web browsing privacy is a matter of paramount importance for the Internet users. While they try to protect themselves from being monitored by getting advantage of encryption or VPNs, users' privacy is still unaccomplished, even taking into account the tangled web, with several domains visited at the same time in a single web page, or IP addresses of a cloud provider shared by several sites. In this work, we provide a novel approach to identify user web browsing that only takes into account the IP addresses that the user has connected to and without performing any DNS reverse resolutions. We use this sequence of addresses as an input of different state-of-the-art deep learning models, such as multi-layer perceptron and transformers, which are able to accurately identify which was the website actually visited among Alexa's World Top 500 most visited domains. Moreover, we have also studied other factors, such as the dependence on the DNS server used to resolve the visited IP addresses, the accuracy for the top domains (e.g., Google, YouTube, Facebook, etc.), data augmentation by packet sampling simulation to improve our results, the impact on packet sampling and the fine-tuning and possible impact of model parameters or the scalability of our approach. We conclude that, using only a 10% of the packets, we can identify the visited website with an accuracy and F1 score between 94% and 95%.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Assessing the Limits of Privacy and Data Usage for Web Browsing Analytics
    Perdices, Daniel
    Lopez de Vergara, Jorge E.
    Gonzalez, Ivan
    PROCEEDINGS OF THE 2021 17TH INTERNATIONAL CONFERENCE ON NETWORK AND SERVICE MANAGEMENT (CNSM 2021): SMART MANAGEMENT FOR FUTURE NETWORKS AND SERVICES, 2021, : 173 - 179
  • [2] Privacy-Preserving Deep Learning via Additively Homomorphic Encryption
    Phong, Le Trieu
    Aono, Yoshinori
    Hayashi, Takuya
    Wang, Lihua
    Moriai, Shiho
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2018, 13 (05) : 1333 - 1345
  • [3] Differential privacy in deep learning: Privacy and beyond
    Wang, Yanling
    Wang, Qian
    Zhao, Lingchen
    Wang, Cong
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 148 : 408 - 424
  • [4] Using Adversarial Noises to Protect Privacy in Deep Learning Era
    Liu, Bo
    Ding, Ming
    Zhu, Tianqing
    Xiang, Yong
    Zhou, Wanlei
    2018 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2018,
  • [5] Adversaries or allies? Privacy and deep learning in big data era
    Liu, Bo
    Ding, Ming
    Zhu, Tianqing
    Xiang, Yong
    Zhou, Wanlei
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (19)
  • [6] A Survey of Deep Learning Architectures for Privacy-Preserving Machine Learning With Fully Homomorphic Encryption
    Podschwadt, Robert
    Takabi, Daniel
    Hu, Peizhao
    Rafiei, Mohammad H. H.
    Cai, Zhipeng
    IEEE ACCESS, 2022, 10 : 117477 - 117500
  • [7] On Fully Homomorphic Encryption for Privacy-Preserving Deep Learning
    Hernandez Marcano, Nestor J.
    Moller, Mads
    Hansen, Soren
    Jacobsen, Rune Hylsberg
    2019 IEEE GLOBECOM WORKSHOPS (GC WKSHPS), 2019,
  • [8] Distributed additive encryption and quantization for privacy preserving federated deep learning
    Zhu, Hangyu
    Wang, Rui
    Jin, Yaochu
    Liang, Kaitai
    Ning, Jianting
    NEUROCOMPUTING, 2021, 463 : 309 - 327
  • [9] Privacy-Preserving Image Captioning with Partial Encryption and Deep Learning
    Martin, Antoinette Deborah
    Moon, Inkyu
    MATHEMATICS, 2025, 13 (04)
  • [10] Deep Learning: Differential Privacy Preservation in the Era of Big Data
    Vasa, Jalpesh
    Thakkar, Amit
    JOURNAL OF COMPUTER INFORMATION SYSTEMS, 2023, 63 (03) : 608 - 631