Resource Pricing and Allocation in MEC Enabled Blockchain Systems: An A3C Deep Reinforcement Learning Approach

被引:137
作者
Du, Jianbo [1 ]
Cheng, Wenjie [1 ]
Lu, Guangyue [1 ]
Cao, Haotong [2 ]
Chu, Xiaoli [3 ]
Zhang, Zhicai [4 ]
Wang, Junxuan [1 ]
机构
[1] Xian Univ Posts & Telecommun, Sch Commun & Informat Engn, Shaanxi Key Lab Informat Commun Network & Secur, Xian 710121, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Coll Telecommun & Informat Engn, Nanjing 210003, Peoples R China
[3] Univ Sheffield, Dept Elect & Elect Engn, Sheffield S1 3JD, S Yorkshire, England
[4] Shanxi Univ, Sch Phys & Elect Engn, Taiyuan 030006, Peoples R China
来源
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING | 2022年 / 9卷 / 01期
关键词
Wireless communication; Multi-access edge computing; Simulation; Reinforcement learning; Pricing; Blockchains; Resource management; Asynchronous advantage actor-critic (A3C); blockchain; deep reinforcement learning; mobile edge computing; pricing; resource allocation; WIRELESS NETWORKS; JOINT OPTIMIZATION; EDGE; RADIO;
D O I
10.1109/TNSE.2021.3068340
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
When using blockchain in mobile systems, computation intensive mining tasks pose great challenges to the processing capabilities of mobile miner equipment. Mobile edge computing (MEC) is an effective solution to alleviating the problem via task offloading. In the mining process, miners compete for rewards through puzzle solving, where only the miner that first completes the process will be rewarded. Thus, miners may wish to pay higher price and use more communication resources in task offloading and more computation resources in task processing for latency reduction. However, there are risks for the miners not profiting from consuming more resources or paying a higher price, so miners are rational in blockchain systems. In order to maximize the rational total profit of all miners, we use an asynchronous advantage actor-critic (A3C) deep reinforcement learning algorithm to obtain the resource pricing and allocation, considering the stochastic properties of wireless channels, and the prospect theory is employed to strike a good balance between risks and rewards. Numerical results show that our proposed A3C based joint optimization algorithm converges fast and outperforms the baseline algorithms in terms of the total reward.
引用
收藏
页码:33 / 44
页数:12
相关论文
共 41 条
[11]   Cooperative Computation Offloading and Resource Allocation for Blockchain-Enabled Mobile-Edge Computing: A Deep Reinforcement Learning Approach [J].
Feng, Jie ;
Yu, F. Richard ;
Pei, Qingqi ;
Chu, Xiaoli ;
Du, Jianbo ;
Zhu, Li .
IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (07) :6214-6228
[12]   Towards random-honest miners selection and multi-blocks creation: Proof-of-negotiation consensus mechanism in blockchain networks [J].
Feng, Jingyu ;
Zhao, Xinyu ;
Chen, Kexuan ;
Zhao, Feng ;
Zhang, Guanghua .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 105 :248-258
[13]   A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients [J].
Grondman, Ivo ;
Busoniu, Lucian ;
Lopes, Gabriel A. D. ;
Babuska, Robert .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2012, 42 (06) :1291-1307
[14]   Adaptive Resource Allocation in Future Wireless Networks With Blockchain and Mobile Edge Computing [J].
Guo, Fengxian ;
Yu, F. Richard ;
Zhang, Heli ;
Ji, Hong ;
Liu, Mengting ;
Leung, Victor C. M. .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (03) :1689-1703
[15]   Integrated Networking, Caching, and Computing for Connected Vehicles: A Deep Reinforcement Learning Approach [J].
He, Ying ;
Zhao, Nan ;
Yin, Hongxi .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2018, 67 (01) :44-55
[16]   Software-Defined Networks with Mobile Edge Computing and Caching for Smart Cities: A Big Data Deep Reinforcement Learning Approach [J].
He, Ying ;
Yu, F. Richard ;
Zhao, Nan ;
Leung, Victor C. M. ;
Yin, Hongxi .
IEEE COMMUNICATIONS MAGAZINE, 2017, 55 (12) :31-37
[17]   SURVIVOR: A blockchain based edge-as-a-service framework for secure energy trading in SDN-enabled vehicle-to-grid environment [J].
Jindal, Anish ;
Aujla, Gagangeet Singh ;
Kumar, Neeraj .
COMPUTER NETWORKS, 2019, 153 :36-48
[18]  
Ke Zhang, 2019, 2019 IEEE International Conference on Smart Internet of Things (SmartIoT). Proceedings, P234, DOI 10.1109/SmartIoT.2019.00043
[19]   Hierarchical Edge Caching in Device-to-Device Aided Mobile Networks: Modeling, Optimization, and Design [J].
Li, Xiuhua ;
Wang, Xiaofei ;
Wan, Peng-Jun ;
Han, Zhu ;
Leung, Victor C. M. .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2018, 36 (08) :1768-1785
[20]   Blockchain-Enabled Secure Data Sharing Scheme in Mobile-Edge Computing: An Asynchronous Advantage Actor-Critic Learning Approach [J].
Liu, Lei ;
Feng, Jie ;
Pei, Qingqi ;
Chen, Chen ;
Ming, Yang ;
Shang, Bodong ;
Dong, Mianxiong .
IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (04) :2342-2353