Learning to Entangle Radio Resources in Vehicular Communications: An Oblivious Game-Theoretic Perspective

被引:12
作者
Chen, Xianfu [1 ]
Wu, Celimuge [2 ]
Bennis, Mehdi [3 ]
Zhao, Zhifeng [4 ]
Han, Zhu [5 ,6 ]
机构
[1] VTT Tech Res Ctr Finland, Oulu 1000, Finland
[2] Univ Electrocommun, Grad Sch Informat & Engn, Tokyo 1828585, Japan
[3] Univ Oulu, Ctr Wireless Commun, Oulu 90014, Finland
[4] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310027, Peoples R China
[5] Univ Houston, Houston, TX 77004 USA
[6] Kyung Hee Univ, Dept Comp Sci & Engn, Seoul 02447, South Korea
基金
芬兰科学院; 美国国家科学基金会;
关键词
Vehicle-to-vehicle communications; Markov decision process; stochastic games; Markov perfect equilibrium; oblivious equilibrium; reinforcement learning; NETWORK; INFORMATION;
D O I
10.1109/TVT.2019.2907589
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we investigate non-cooperative radio resource management in a vehicle-to-vehicle communication network. The technical challenges lie in high-vehicle mobility and data traffic variations. Over the discrete scheduling slots, each vehicle user equipment (VUE)-pair competes with other VUE-pairs in the coverage of a road side unit (RSU) for the limited frequency to transmit queued data packets, aiming to optimize the expected long-term performance. The frequency allocation at the beginning of each slot at the RSU is regulated by a sealed second-price auction. Such interactions among VUE-pairs are modeled as a stochastic game with a semi-continuous global network state space. By defining a partitioned control policy, we transform the original game into an equivalent stochastic game with a global queue state space of finite size. We adopt an oblivious equilibrium (OE) to approximate the Markov perfect equilibrium, which characterizes the optimal solution to the equivalent game. The OE solution is theoretically proven to be with an asymptotic Markov equilibrium property. Due to the lack of a priori knowledge of network dynamics, we derive an online algorithm to learn the OE solution. Numerical simulations validate the theoretical analysis and show the effectiveness of the proposed online learning algorithm.
引用
收藏
页码:4262 / 4274
页数:13
相关论文
共 39 条
  • [1] On Oblivious Equilibrium in Large Population Stochastic Games
    Adlakha, Sachin
    Johari, Ramesh
    Weintraub, Gabriel Y.
    Goldsmith, Andrea
    [J]. 49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 3117 - 3124
  • [2] Information-Centric Networking for Connected Vehicles: A Survey and Future Perspectives
    Amadeo, Marica
    Campolo, Claudia
    Molinaro, Antonella
    [J]. IEEE COMMUNICATIONS MAGAZINE, 2016, 54 (02) : 98 - 104
  • [3] [Anonymous], 2018, Abstract Dynamic Programming
  • [4] [Anonymous], 2018, ARXIV180708127
  • [5] [Anonymous], 2018, ARXIV180709350
  • [6] Low Complexity Outage Optimal Distributed Channel Allocation for Vehicle-to-Vehicle Communications
    Bai, Bo
    Chen, Wei
    Ben Letaief, Khaled
    Cao, Zhigang
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2011, 29 (01) : 161 - 172
  • [7] Bai F, 2003, IEEE INFOCOM SER, P825
  • [8] 5G NETWORK SLICING FOR VEHICLE-TO-EVERYTHING SERVICES
    Campolo, Claudia
    Molinaro, Antonella
    Iera, Antonio
    Menichella, Francesco
    [J]. IEEE WIRELESS COMMUNICATIONS, 2017, 24 (06) : 38 - 45
  • [9] Chen X., 2018, P IEEE VEH TECHN C C
  • [10] Optimized Computation Offloading Performance in Virtual Edge Computing Systems via Deep Reinforcement Learning
    Chen, Xianfu
    Zhang, Honggang
    Wu, Celimuge
    Mao, Shiwen
    Ji, Yusheng
    Bennis, Mehdi
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (03): : 4005 - 4018