Web browsing privacy in the deep learning era: Beyond VPNs and encryption

被引:4
作者
Perdices, Daniel [1 ]
de Vergara, Jorge E. Lopez [1 ,2 ]
Gonzalez, Ivan [1 ,2 ]
de Pedro, Luis [1 ,2 ]
机构
[1] Univ Autonoma Madrid, Escuela Politecn Super, Dept Elect & Commun Technol, Madrid, Spain
[2] Naudit High Performance Comp & Networking SL, Madrid, Spain
关键词
Web browsing analytics; Neural network; Privacy; Deep learning; Transformer; MOBILE APP IDENTIFICATION; INTERNET;
D O I
10.1016/j.comnet.2022.109471
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Web browsing privacy is a matter of paramount importance for the Internet users. While they try to protect themselves from being monitored by getting advantage of encryption or VPNs, users' privacy is still unaccomplished, even taking into account the tangled web, with several domains visited at the same time in a single web page, or IP addresses of a cloud provider shared by several sites. In this work, we provide a novel approach to identify user web browsing that only takes into account the IP addresses that the user has connected to and without performing any DNS reverse resolutions. We use this sequence of addresses as an input of different state-of-the-art deep learning models, such as multi-layer perceptron and transformers, which are able to accurately identify which was the website actually visited among Alexa's World Top 500 most visited domains. Moreover, we have also studied other factors, such as the dependence on the DNS server used to resolve the visited IP addresses, the accuracy for the top domains (e.g., Google, YouTube, Facebook, etc.), data augmentation by packet sampling simulation to improve our results, the impact on packet sampling and the fine-tuning and possible impact of model parameters or the scalability of our approach. We conclude that, using only a 10% of the packets, we can identify the visited website with an accuracy and F1 score between 94% and 95%.
引用
收藏
页数:16
相关论文
共 50 条
[21]   Enhancing Web privacy and anonymity in the digital era [J].
Gritzalis, Stefanos .
Information Management and Computer Security, 2004, 12 (03) :255-288
[22]   Genomics enters the deep learning era [J].
Routhier, Etienne ;
Mozziconacci, Julien .
PEERJ, 2022, 10
[23]   Image denoising in the deep learning era [J].
Izadi, Saeed ;
Sutton, Darren ;
Hamarneh, Ghassan .
ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (07) :5929-5974
[24]   PADL: Privacy-Aware and Asynchronous Deep Learning for IoT Applications [J].
Liu, Xiaoyuan ;
Li, Hongwei ;
Xu, Guowen ;
Liu, Sen ;
Liu, Zhe ;
Lu, Rongxing .
IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (08) :6955-6969
[25]   When Deep Learning Meets Differential Privacy: Privacy,Security, and More [J].
Li, Xinyan ;
Chen, Yufei ;
Wang, Cong ;
Shen, Chao .
IEEE NETWORK, 2021, 35 (06) :148-155
[26]   Privacy-Preserving Deep Learning and Inference [J].
Riazi, M. Sadegh ;
Koushanfar, Farinaz .
2018 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD) DIGEST OF TECHNICAL PAPERS, 2018,
[27]   Differential Privacy for Deep and Federated Learning: A Survey [J].
El Ouadrhiri, Ahmed ;
Abdelhadi, Ahmed .
IEEE ACCESS, 2022, 10 :22359-22380
[28]   FEDL: Confidential Deep Learning for Autonomous Driving in VANETs Based on Functional Encryption [J].
Tang, Min ;
Huang, Zengyi ;
Deng, Guoqiang .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (12) :21074-21085
[29]   PReDIHERO - Privacy-Preserving Remote Deep Learning Inference based on Homomorphic Encryption and Reversible Obfuscation for Enhanced Client-side Overhead in Pervasive Health Monitoring [J].
Boulemtafes, Amine ;
Derhab, Abdelouahid ;
Braham, Nassim Ait Ali ;
Challal, Yacine .
2021 IEEE/ACS 18TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2021,
[30]   Towards Privacy in Deep Learning [J].
Kerschbaum, Florian .
2021 THIRD IEEE INTERNATIONAL CONFERENCE ON TRUST, PRIVACY AND SECURITY IN INTELLIGENT SYSTEMS AND APPLICATIONS (TPS-ISA 2021), 2021, :279-280