Privacy-Preserving Probabilistic Data Encoding for IoT Data Analysis

被引:1
作者
Zaman, Zakia [1 ,2 ]
Xue, Wanli [1 ,2 ]
Gauravaram, Praveen [3 ]
Hu, Wen [1 ,2 ]
Jiang, Jiaojiao [1 ,2 ]
Jha, Sanjay K. [1 ,2 ]
机构
[1] Univ New South Wales UNSW, Inst Cybersecur IFCYBER, Sydney, NSW 2052, Australia
[2] Cyber Secur Cooperat Res Ctr, Joondalup, WA 6027, Australia
[3] Tata Consultancy Serv Ltd, Brisbane, Qld 2060, Australia
关键词
Data privacy; Encoding; Differential privacy; Data models; Privacy; Data analysis; Cloud computing; Data encoding; bloom filter; privacy-preserving machine learning; differential privacy; privacy; utility;
D O I
10.1109/TIFS.2024.3468150
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The widespread integration of the Internet of Things (IoT) is crucial in advancing sustainable development. IoT service providers actively collect user data for analysis using sophisticated Deep Learning (DL) algorithms. This enables the extraction of valuable insights for business intelligence and improving service quality. However, as these datasets contain sensitive personal information, there is a risk of privacy breaches when DL models are employed. This vulnerability may result in Membership Inference Attacks (MIA), potentially leading to the unauthorized disclosure of highly sensitive data. Therefore, developing an efficient and privacy-preserving data analysis system for IoT is imperative. Recent research has highlighted the effectiveness of utilizing Bloom Filter (BF)-encoding in conjunction with Differential Privacy (DP) for safeguarding privacy during data analysis. Given its attributes of low complexity and high utility, this approach proves effective, particularly in resource-constrained IoT domains. With this in mind, we propose a novel framework for privacy-preserving IoT data analysis based on BF-encoded data. Our research introduces an innovative BF-encoding technique combined with Local Differential Privacy (LDP), capable of efficiently encoding various types of IoT data (such as facial images and smart-meter data) while maintaining privacy when integrated into DL algorithms for downstream analysis. Experimental results demonstrate that our BF-encoded data surpasses the utility of standard BF-encoded data when utilized in DL algorithms for downstream tasks, showcasing an approximate 30% improvement in classification accuracy. Furthermore, we assess the privacy of these DL models against MIA, revealing that attackers can only make random guesses with an accuracy of approximately 50%.
引用
收藏
页码:9173 / 9187
页数:15
相关论文
共 60 条
[31]  
Papernot N, 2021, AAAI CONF ARTIF INTE, V35, P9312
[32]  
Paverd A., 2014, Technical Report
[33]  
Pedregosa F, 2011, J MACH LEARN RES, V12, P2825
[34]  
Ponnusamy A., 2019, Gender-Detection-Keras
[35]  
Ponnusamy Arun, 2018, CVLIB-High Level Computer Vision Library for Python
[36]   XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks [J].
Rastegari, Mohammad ;
Ordonez, Vicente ;
Redmon, Joseph ;
Farhadi, Ali .
COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 :525-542
[37]  
Sarker K, 2020, Arxiv, DOI arXiv:2009.12027
[38]  
Schnell R, 2016, INT CONF DAT MIN WOR, P218, DOI [10.1109/ICDMW.2016.29, 10.1109/ICDMW.2016.0038]
[39]   Privacy-preserving record linkage using Bloom filters [J].
Schnell, Rainer ;
Bachteler, Tobias ;
Reiher, Joerg .
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2009, 9
[40]   Toward Practical Privacy-Preserving Analytics for IoT and Cloud-Based Healthcare Systems [J].
Sharma, Sagar ;
Chen, Keke ;
Sheth, Amit .
IEEE INTERNET COMPUTING, 2018, 22 (02) :42-51