Web browsing privacy in the deep learning era: Beyond VPNs and encryption

被引:4
作者
Perdices, Daniel [1 ]
de Vergara, Jorge E. Lopez [1 ,2 ]
Gonzalez, Ivan [1 ,2 ]
de Pedro, Luis [1 ,2 ]
机构
[1] Univ Autonoma Madrid, Escuela Politecn Super, Dept Elect & Commun Technol, Madrid, Spain
[2] Naudit High Performance Comp & Networking SL, Madrid, Spain
关键词
Web browsing analytics; Neural network; Privacy; Deep learning; Transformer; MOBILE APP IDENTIFICATION; INTERNET;
D O I
10.1016/j.comnet.2022.109471
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Web browsing privacy is a matter of paramount importance for the Internet users. While they try to protect themselves from being monitored by getting advantage of encryption or VPNs, users' privacy is still unaccomplished, even taking into account the tangled web, with several domains visited at the same time in a single web page, or IP addresses of a cloud provider shared by several sites. In this work, we provide a novel approach to identify user web browsing that only takes into account the IP addresses that the user has connected to and without performing any DNS reverse resolutions. We use this sequence of addresses as an input of different state-of-the-art deep learning models, such as multi-layer perceptron and transformers, which are able to accurately identify which was the website actually visited among Alexa's World Top 500 most visited domains. Moreover, we have also studied other factors, such as the dependence on the DNS server used to resolve the visited IP addresses, the accuracy for the top domains (e.g., Google, YouTube, Facebook, etc.), data augmentation by packet sampling simulation to improve our results, the impact on packet sampling and the fine-tuning and possible impact of model parameters or the scalability of our approach. We conclude that, using only a 10% of the packets, we can identify the visited website with an accuracy and F1 score between 94% and 95%.
引用
收藏
页数:16
相关论文
共 50 条
[41]   A review of privacy-preserving techniques for deep learning [J].
Boulemtafes, Amine ;
Derhab, Abdelouahid ;
Challal, Yacine .
NEUROCOMPUTING, 2020, 384 :21-45
[42]   The road to privacy in IoT: beyond encryption and signatures, towards unobservable communication [J].
Staudemeyer, Ralf C. ;
Poehls, Henrich C. ;
Wojcik, Marcin .
2018 IEEE 19TH INTERNATIONAL SYMPOSIUM ON A WORLD OF WIRELESS, MOBILE AND MULTIMEDIA NETWORKS (WOWMOM), 2018,
[43]   Protecting User Privacy: An Approach for Untraceable Web Browsing History and Unambiguous User Profiles [J].
Beigi, Ghazaleh ;
Guo, Ruocheng ;
Nou, Alexander ;
Zhang, Yanchao ;
Liu, Huan .
PROCEEDINGS OF THE TWELFTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM'19), 2019, :213-221
[44]   Deep Learning-Based Privacy Preservation and Data Analytics for IoT Enabled Healthcare [J].
Bi, Hongliang ;
Liu, Jiajia ;
Kato, Nei .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (07) :4798-4807
[45]   Beyond Moral Coupling: Analysing Politics of Privacy in the Era of Surveillance [J].
Heikkila, Heikki .
MEDIA AND COMMUNICATION, 2020, 8 (02) :248-257
[46]   Privacy-preserving multi-party deep learning based on homomorphic proxy re-encryption [J].
Shen, Xiaoying ;
Luo, Xue ;
Yuan, Feng ;
Wang, Baocang ;
Chen, Yange ;
Tang, Dianhua ;
Gao, Le .
JOURNAL OF SYSTEMS ARCHITECTURE, 2023, 144
[47]   Wavelets in the Deep Learning Era [J].
Ramzi, Zaccharie ;
Michalewicz, Kevin ;
Starck, Jean-Luc ;
Moreau, Thomas ;
Ciuciu, Philippe .
JOURNAL OF MATHEMATICAL IMAGING AND VISION, 2023, 65 (01) :240-251
[48]   WAVELETS IN THE DEEP LEARNING ERA [J].
Ramzi, Zaccharie ;
Starck, Jean-Luc ;
Moreau, Thomas ;
Ciuciu, Philippe .
28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, :1417-1421
[49]   Wavelets in the Deep Learning Era [J].
Zaccharie Ramzi ;
Kevin Michalewicz ;
Jean-Luc Starck ;
Thomas Moreau ;
Philippe Ciuciu .
Journal of Mathematical Imaging and Vision, 2023, 65 :240-251
[50]   GeFL: Gradient Encryption-Aided Privacy Preserved Federated Learning for Autonomous Vehicles [J].
Parekh, Raj ;
Patel, Nisarg ;
Gupta, Rajesh ;
Jadav, Nilesh Kumar ;
Tanwar, Sudeep ;
Alharbi, Abdullah ;
Tolba, Amr ;
Neagu, Bogdan-Constantin ;
Raboaca, Maria Simona .
IEEE ACCESS, 2023, 11 :1825-1839