A novel deep policy gradient action quantization for trusted collaborative computation in intelligent vehicle networks

被引:17
作者
Chen, Miaojiang [1 ]
Yi, Meng [2 ]
Huang, Mingfeng [1 ]
Huang, Guosheng [3 ]
Ren, Yingying [1 ]
Liu, Anfeng [1 ]
机构
[1] Cent South Univ, Sch Comp Sci & Engn, Changsha 410083, Peoples R China
[2] Southeast Univ, Sch Comp Sci & Engn, Nanjing, Peoples R China
[3] Hunan First Normal Univ, Sch Comp Sci, Changsha, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep reinforcement learning; Multi-agent; Trusted computing; Decision-making; Intelligent vehicle networks; MANAGEMENT SCHEME; EDGE; INTERNET;
D O I
10.1016/j.eswa.2023.119743
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The openness of the intelligent vehicle network makes it easy for selfish or untrustworthy vehicles to maliciously occupy limited resources in the mobile edge network or spread malicious information. However, most of the existing trust models rely on evaluating vehicles or data at the application level. For selfish or forgery attacks in intelligent vehicle networks, we propose a trusted deep reinforcement learning (DRL) cybersecurity approach for computation offloading to evaluate the safety and reliability performance in IoT edge networks, including our intelligent system model and a Deep Policy Gradient Action Quantization (DPGAQ) scheme. By introducing a reputation record table and designing a highly decisive communication trusted computing mode, we can accurately predict the untrusted selfish attack of vehicle in the task offloading of the Internet of things. Furthermore, in the multi-vehicle scenario, because the trusted offloading decision is a mixed integer programming problem, which leads to the dimension explosion of channel state and space, we propose a joint action-value quantization with attention mechanism to approximate the continuous actions values to a limited number of discrete values. Because it is not only inefficient but also unnecessary to generate high-dimensional decision actions in each time frame, we prune the infeasible action decisions by order preserving pruning to reduce the computational complexity of training and achieve efficient training on the premise of ensuring accuracy. To verify the feasibility and effectiveness of our proposed algorithm, millions of channels of edge vehicle networks are used as the input data. The simulation results show that compared with the benchmark trust model, DPGAQ achieves more than 72% reputation level, and improves 11%, 10% and 11% respectively in precision, recall and F-score.
引用
收藏
页数:13
相关论文
共 41 条
  • [1] NOTRINO: A NOvel Hybrid TRust Management Scheme for INternet-of-Vehicles
    Ahmad, Farhan
    Kurugollu, Fatih
    Kerrache, Chaker Abdelaziz
    Sezer, Sakir
    Liu, Lu
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (09) : 9244 - 9257
  • [2] MARINE: Man-in-the-Middle Attack Resistant Trust Model in Connected Vehicles
    Ahmad, Farhan
    Kurugollu, Fatih
    Adnane, Asma
    Hussain, Rasheed
    Hussain, Fatima
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (04) : 3310 - 3322
  • [3] Akella RT, 2021, AAAI CONF ARTIF INTE, V35, P6600
  • [4] Security Vulnerabilities of Connected Vehicle Streams and Their Impact on Cooperative Driving
    Amoozadeh, Mani
    Raghuramu, Arun
    Chuah, Chen-Nee
    Ghosal, Dipak
    Zhang, H. Michael
    Rowe, Jeff
    Levitt, Karl
    [J]. IEEE COMMUNICATIONS MAGAZINE, 2015, 53 (06) : 126 - 132
  • [5] Babaeizadeh M, 2017, PhD
  • [6] Collaborative Service Placement for Edge Computing in Dense Small Cell Networks
    Chen, Lixing
    Shen, Cong
    Zhou, Pan
    Xu, Jie
    [J]. IEEE TRANSACTIONS ON MOBILE COMPUTING, 2021, 20 (02) : 377 - 390
  • [7] GPDS: A multi-agent deep reinforcement learning game for anti-jamming secure computing in MEC network
    Chen, Miaojiang
    Liu, Wei
    Zhang, Ning
    Li, Junling
    Ren, Yingying
    Yi, Meng
    Liu, Anfeng
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 210
  • [8] A novel differential dynamic gradient descent optimization algorithm for resource allocation and offloading in the COMEC system
    Chen, Miaojiang
    Li, Zeyuan
    Chen, Peipei
    Liu, Wei
    Liu, Anfeng
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (11) : 8365 - 8386
  • [9] RDRL: A Recurrent Deep Reinforcement Learning Scheme for Dynamic Spectrum Access in Reconfigurable Wireless Networks
    Chen, Miaojiang
    Liu, Anfeng
    Liu, Wei
    Ota, Kaoru
    Dong, Mianxiong
    Xiong, N. Neal
    [J]. IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2022, 9 (02): : 364 - 376
  • [10] A game-based deep reinforcement learning approach for energy-efficient computation in MEC systems
    Chen, Miaojiang
    Liu, Wei
    Wang, Tian
    Zhang, Shaobo
    Liu, Anfeng
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 235