Resource Pricing and Allocation in MEC Enabled Blockchain Systems: An A3C Deep Reinforcement Learning Approach

被引:137
作者
Du, Jianbo [1 ]
Cheng, Wenjie [1 ]
Lu, Guangyue [1 ]
Cao, Haotong [2 ]
Chu, Xiaoli [3 ]
Zhang, Zhicai [4 ]
Wang, Junxuan [1 ]
机构
[1] Xian Univ Posts & Telecommun, Sch Commun & Informat Engn, Shaanxi Key Lab Informat Commun Network & Secur, Xian 710121, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Coll Telecommun & Informat Engn, Nanjing 210003, Peoples R China
[3] Univ Sheffield, Dept Elect & Elect Engn, Sheffield S1 3JD, S Yorkshire, England
[4] Shanxi Univ, Sch Phys & Elect Engn, Taiyuan 030006, Peoples R China
来源
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING | 2022年 / 9卷 / 01期
关键词
Wireless communication; Multi-access edge computing; Simulation; Reinforcement learning; Pricing; Blockchains; Resource management; Asynchronous advantage actor-critic (A3C); blockchain; deep reinforcement learning; mobile edge computing; pricing; resource allocation; WIRELESS NETWORKS; JOINT OPTIMIZATION; EDGE; RADIO;
D O I
10.1109/TNSE.2021.3068340
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
When using blockchain in mobile systems, computation intensive mining tasks pose great challenges to the processing capabilities of mobile miner equipment. Mobile edge computing (MEC) is an effective solution to alleviating the problem via task offloading. In the mining process, miners compete for rewards through puzzle solving, where only the miner that first completes the process will be rewarded. Thus, miners may wish to pay higher price and use more communication resources in task offloading and more computation resources in task processing for latency reduction. However, there are risks for the miners not profiting from consuming more resources or paying a higher price, so miners are rational in blockchain systems. In order to maximize the rational total profit of all miners, we use an asynchronous advantage actor-critic (A3C) deep reinforcement learning algorithm to obtain the resource pricing and allocation, considering the stochastic properties of wireless channels, and the prospect theory is employed to strike a good balance between risks and rewards. Numerical results show that our proposed A3C based joint optimization algorithm converges fast and outperforms the baseline algorithms in terms of the total reward.
引用
收藏
页码:33 / 44
页数:12
相关论文
共 41 条
[1]   Deep Reinforcement Learning A brief survey [J].
Arulkumaran, Kai ;
Deisenroth, Marc Peter ;
Brundage, Miles ;
Bharath, Anil Anthony .
IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) :26-38
[2]   A Decoupled Blockchain Approach for Edge-Envisioned IoT-Based Healthcare Monitoring [J].
Aujla, Gagangeet Singh ;
Jindal, Anish .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2021, 39 (02) :491-499
[3]   Dynamic Embedding and Quality of Service-Driven Adjustment for Cloud Networks [J].
Cao, Haotong ;
Wu, Shengchen ;
Aujla, Gagangeet Singh ;
Wang, Qin ;
Yang, Longxiang ;
Zhu, Hongbo .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (02) :1406-1416
[4]   A Joint Learning and Communications Framework for Federated Learning Over Wireless Networks [J].
Chen, Mingzhe ;
Yang, Zhaohui ;
Saad, Walid ;
Yin, Changchuan ;
Poor, H. Vincent ;
Cui, Shuguang .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (01) :269-283
[5]   Federated Echo State Learning for Minimizing Breaks in Presence in Wireless Virtual Reality Networks [J].
Chen, Mingzhe ;
Semiari, Omid ;
Saad, Walid ;
Liu, Xuanlin ;
Yin, Changchuan .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (01) :177-191
[6]   Artificial Neural Networks-Based Machine Learning for Wireless Networks: A Tutorial [J].
Chen, Mingzhe ;
Challita, Ursula ;
Saad, Walid ;
Yin, Changchuan ;
Debbah, Merouane .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2019, 21 (04) :3039-3071
[7]  
Du J., IEEE INTERNET THINGS, P2021
[8]   MEC-Assisted Immersive VR Video Streaming Over Terahertz Wireless Networks: A Deep Reinforcement Learning Approach [J].
Du, Jianbo ;
Yu, F. Richard ;
Lu, Guangyue ;
Wang, Junxuan ;
Jiang, Jing ;
Chu, Xiaoli .
IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (10) :9517-9529
[9]   Computation Offloading and Resource Allocation in Vehicular Networks Based on Dual-Side Cost Minimization [J].
Du, Jianbo ;
Yu, F. Richard ;
Chu, Xiaoli ;
Feng, Jie ;
Lu, Guangyue .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (02) :1079-1092
[10]   Joint Optimization of Radio and Computational Resources Allocation in Blockchain-Enabled Mobile Edge Computing Systems [J].
Feng, Jie ;
Yu, F. Richard ;
Pei, Qingqi ;
Du, Jianbo ;
Zhu, Li .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (06) :4321-4334