IoT Big Data provenance scheme using blockchain on Hadoop ecosystem

被引:28
作者
Pajooh, Houshyar Honar [1 ]
Rashid, Mohammed A. [1 ]
Alam, Fakhrul [1 ,2 ]
Demidenko, Serge [1 ,2 ]
机构
[1] Massey Univ, Sch Food & Adv Technol, Dept Mech & Elect Engn, Auckland 0632, New Zealand
[2] Sunway Univ, Sch Sci & Technol, Sunway 47500, Selangor, Malaysia
关键词
Internet of Things; Hyperledger fabric; Blockchain; Big Data; Data provenance; Hadoop; Traceability; EDGE; FRAMEWORK; STORAGE;
D O I
10.1186/s40537-021-00505-y
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The diversity and sheer increase in the number of connected Internet of Things (IoT) devices have brought significant concerns associated with storing and protecting a large volume of IoT data. Storage volume requirements and computational costs are continuously rising in the conventional cloud-centric IoT structures. Besides, dependencies of the centralized server solution impose significant trust issues and make it vulnerable to security risks. In this paper, a layer-based distributed data storage design and implementation of a blockchain-enabled large-scale IoT system are proposed. It has been developed to mitigate the above-mentioned challenges by using the Hyperledger Fabric (HLF) platform for distributed ledger solutions. The need for a centralized server and a third-party auditor was eliminated by leveraging HLF peers performing transaction verifications and records audits in a big data system with the help of blockchain technology. The HLF blockchain facilitates storing the lightweight verification tags on the blockchain ledger. In contrast, the actual metadata are stored in the off-chain big data system to reduce the communication overheads and enhance data integrity. Additionally, a prototype has been implemented on embedded hardware showing the feasibility of deploying the proposed solution in IoT edge computing and big data ecosystems. Finally, experiments have been conducted to evaluate the performance of the proposed scheme in terms of its throughput, latency, communication, and computation costs. The obtained results have indicated the feasibility of the proposed solution to retrieve and store the provenance of large-scale IoT data within the Big Data ecosystem using the HLF blockchain. The experimental results show the throughput of about 600 transactions, 500 ms average response time, about 2-3% of the CPU consumption at the peer process and approximately 10-20% at the client node. The minimum latency remained below 1 s however, there is an increase in the maximum latency when the sending rate reached around 200 transactions per second (TPS).
引用
收藏
页数:26
相关论文
共 49 条
[1]   Big healthcare data: preserving security and privacy [J].
Abouelmehdi, Karim ;
Beni-Hessane, Abderrahim ;
Khaloufi, Hayat .
JOURNAL OF BIG DATA, 2018, 5 (01)
[2]   Next Generation 5G Wireless Networks: A Comprehensive Survey [J].
Agiwal, Mamta ;
Roy, Abhishek ;
Saxena, Navrati .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2016, 18 (03) :1617-1655
[3]   Hyperledger Fabric: A Distributed Operating System for Permissioned Blockchains [J].
Androulaki, Elli ;
Barger, Artem ;
Bortnikov, Vita ;
Cachin, Christian ;
Christidis, Konstantinos ;
De Caro, Angelo ;
Enyeart, David ;
Ferris, Christopher ;
Laventman, Gennady ;
Manevich, Yacov ;
Muralidharan, Srinivasan ;
Murthy, Chet ;
Binh Nguyen ;
Sethi, Manish ;
Singh, Gari ;
Smith, Keith ;
Sorniotti, Alessandro ;
Stathakopoulou, Chrysoula ;
Vukolic, Marko ;
Cocco, Sharon Weed ;
Yellick, Jason .
EUROSYS '18: PROCEEDINGS OF THE THIRTEENTH EUROSYS CONFERENCE, 2018,
[4]   Big data adoption: State of the art and research challenges [J].
Baig, Maria Ijaz ;
Shuib, Liyana ;
Yadegaridehkordi, Elaheh .
INFORMATION PROCESSING & MANAGEMENT, 2019, 56 (06)
[5]  
Borthakur D., 2007, HADOOP DISTRIBUTED F
[6]  
Caro Miguel Pincheira, 2018, 2018 IoT Vertical and Topical Summit on Agriculture - Tuscany (IOT Tuscany), DOI 10.1109/IOT-TUSCANY.2018.8373021
[7]  
Cutting MCD, APACHE HADOOP
[8]  
Dedeoglu V., 2020, Advanced Applications of Blockchain Technology, P55, DOI [10.1007/978-981-13-8775-3_3, DOI 10.1007/978-981-13-8775-3_3]
[9]  
Deepa N, ARXIV PREPRINT ARXIV
[10]   Permissioned Blockchain and Edge Computing Empowered Privacy-Preserving Smart Grid Networks [J].
Gai, Keke ;
Wu, Yulu ;
Zhu, Liehuang ;
Xu, Lei ;
Zhang, Yan .
IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (05) :7992-8004