Privacy-Preserving Probabilistic Data Encoding for IoT Data Analysis

被引:1
作者
Zaman, Zakia [1 ,2 ]
Xue, Wanli [1 ,2 ]
Gauravaram, Praveen [3 ]
Hu, Wen [1 ,2 ]
Jiang, Jiaojiao [1 ,2 ]
Jha, Sanjay K. [1 ,2 ]
机构
[1] Univ New South Wales UNSW, Inst Cybersecur IFCYBER, Sydney, NSW 2052, Australia
[2] Cyber Secur Cooperat Res Ctr, Joondalup, WA 6027, Australia
[3] Tata Consultancy Serv Ltd, Brisbane, Qld 2060, Australia
关键词
Data privacy; Encoding; Differential privacy; Data models; Privacy; Data analysis; Cloud computing; Data encoding; bloom filter; privacy-preserving machine learning; differential privacy; privacy; utility;
D O I
10.1109/TIFS.2024.3468150
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The widespread integration of the Internet of Things (IoT) is crucial in advancing sustainable development. IoT service providers actively collect user data for analysis using sophisticated Deep Learning (DL) algorithms. This enables the extraction of valuable insights for business intelligence and improving service quality. However, as these datasets contain sensitive personal information, there is a risk of privacy breaches when DL models are employed. This vulnerability may result in Membership Inference Attacks (MIA), potentially leading to the unauthorized disclosure of highly sensitive data. Therefore, developing an efficient and privacy-preserving data analysis system for IoT is imperative. Recent research has highlighted the effectiveness of utilizing Bloom Filter (BF)-encoding in conjunction with Differential Privacy (DP) for safeguarding privacy during data analysis. Given its attributes of low complexity and high utility, this approach proves effective, particularly in resource-constrained IoT domains. With this in mind, we propose a novel framework for privacy-preserving IoT data analysis based on BF-encoded data. Our research introduces an innovative BF-encoding technique combined with Local Differential Privacy (LDP), capable of efficiently encoding various types of IoT data (such as facial images and smart-meter data) while maintaining privacy when integrated into DL algorithms for downstream analysis. Experimental results demonstrate that our BF-encoded data surpasses the utility of standard BF-encoded data when utilized in DL algorithms for downstream tasks, showcasing an approximate 30% improvement in classification accuracy. Furthermore, we assess the privacy of these DL models against MIA, revealing that attackers can only make random guesses with an accuracy of approximately 50%.
引用
收藏
页码:9173 / 9187
页数:15
相关论文
共 60 条
[1]   Privacy-Preserving Machine Learning: Threats and Solutions [J].
Al-Rubaie, Mohammad ;
Chang, J. Morris .
IEEE SECURITY & PRIVACY, 2019, 17 (02) :49-58
[2]  
[Anonymous], 2003, Internet Math., DOI DOI 10.1080/15427951.2004.10129096
[3]  
[Anonymous], Large-Scale Machine Learning on Heterogeneous Systems-GoogleSearch
[4]  
[Anonymous], 2012, IEEE Signal Process. Mag., DOI DOI 10.1109/MSP.2012.2211477
[5]   Local Differential Privacy for Federated Learning [J].
Arachchige, Pathum Chamikara Mahawaga ;
Liu, Dongxi ;
Camtepe, Seyit ;
Nepal, Surya ;
Grobler, Marthie ;
Bertok, Peter ;
Khalil, Ibrahim .
COMPUTER SECURITY - ESORICS 2022, PT I, 2022, 13554 :195-216
[6]   Local Differential Privacy for Deep Learning [J].
Arachchige, Pathum Chamikara Mahawaga ;
Bertok, Peter ;
Khalil, Ibrahim ;
Liu, Dongxi ;
Camtepe, Seyit ;
Atiquzzaman, Mohammed .
IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (07) :5827-5842
[7]   Ciphertext-policy attribute-based encryption [J].
Bethencourt, John ;
Sahai, Amit ;
Waters, Brent .
2007 IEEE SYMPOSIUM ON SECURITY AND PRIVACY, PROCEEDINGS, 2007, :321-+
[8]   SPACE/TIME TRADE/OFFS IN HASH CODING WITH ALLOWABLE ERRORS [J].
BLOOM, BH .
COMMUNICATIONS OF THE ACM, 1970, 13 (07) :422-&
[9]  
Borgs C., 2019, Optimal parameter choice for Bloom filter-based privacypreserving record linkage
[10]  
Canonne CL, 2020, ADV NEUR IN, V33