IoT Big Data provenance scheme using blockchain on Hadoop ecosystem

被引:0
作者
Houshyar Honar Pajooh
Mohammed A. Rashid
Fakhrul Alam
Serge Demidenko
机构
[1] Massey University,Department of Mechanical & Electrical Engineering, School of Food and Advanced Technology
[2] Sunway University,School of Science and Technology
来源
Journal of Big Data | / 8卷
关键词
Internet of Things; Hyperledger fabric; Blockchain; Big Data; Data provenance; Hadoop; Traceability;
D O I
暂无
中图分类号
学科分类号
摘要
The diversity and sheer increase in the number of connected Internet of Things (IoT) devices have brought significant concerns associated with storing and protecting a large volume of IoT data. Storage volume requirements and computational costs are continuously rising in the conventional cloud-centric IoT structures. Besides, dependencies of the centralized server solution impose significant trust issues and make it vulnerable to security risks. In this paper, a layer-based distributed data storage design and implementation of a blockchain-enabled large-scale IoT system are proposed. It has been developed to mitigate the above-mentioned challenges by using the Hyperledger Fabric (HLF) platform for distributed ledger solutions. The need for a centralized server and a third-party auditor was eliminated by leveraging HLF peers performing transaction verifications and records audits in a big data system with the help of blockchain technology. The HLF blockchain facilitates storing the lightweight verification tags on the blockchain ledger. In contrast, the actual metadata are stored in the off-chain big data system to reduce the communication overheads and enhance data integrity. Additionally, a prototype has been implemented on embedded hardware showing the feasibility of deploying the proposed solution in IoT edge computing and big data ecosystems. Finally, experiments have been conducted to evaluate the performance of the proposed scheme in terms of its throughput, latency, communication, and computation costs. The obtained results have indicated the feasibility of the proposed solution to retrieve and store the provenance of large-scale IoT data within the Big Data ecosystem using the HLF blockchain. The experimental results show the throughput of about 600 transactions, 500 ms average response time, about 2–3% of the CPU consumption at the peer process and approximately 10–20% at the client node. The minimum latency remained below 1 s however, there is an increase in the maximum latency when the sending rate reached around 200 transactions per second (TPS).
引用
收藏
相关论文
共 92 条
[1]  
Gantz J(2011)Extracting value from chaos. IDC iview 1142 1-12
[2]  
Reinsel D(2018)Multimedia big data analytics: a survey ACM Comput Surv (CSUR) 51 1-34
[3]  
Pouyanfar S(2019)Enhanced secured map reduce layer for big data privacy and security J Big Data 6 1-17
[4]  
Yang Y(2018)Big healthcare data: preserving security and privacy J Big Data 5 1-18
[5]  
Chen S-C(2021)Designing a permissioned blockchain network for the halal industry using hyperledger fabric with multiple channels and the raft consensus mechanism J Big Data 8 1-16
[6]  
Shyu M-L(2019)Big data adoption: state of the art and research challenges Inf Process Manag 56 102095-3526
[7]  
Iyengar S(2021)Multi-layer blockchain-based security architecture for internet of things Sensors 21 772-4195
[8]  
Jain P(2021)Hyperledger fabric blockchain for securing the edge internet of things Sensors 21 359-1532
[9]  
Gyanchandani M(2018)Blockchain-enabled data collection and sharing for industrial iot with deep reinforcement learning IEEE Trans Ind Inf 15 3516-1655
[10]  
Khare N(2019)Become: blockchain-enabled computation offloading for iot in mobile edge computing IEEE Trans Ind Inf 16 4187-32