Adaptive constraint satisfaction for Markov decision process congestion games: Application to transportation networks

被引:1
作者
Li, Sarah H. Q. [1 ]
Yu, Yue [2 ]
Miguel, Nicolas I. [3 ]
Calderone, Dan [4 ]
Ratliff, Lillian J. [4 ]
Acikmese, Behcet [1 ]
机构
[1] Univ Washington, Dept Aeronaut & Astronaut, Seattle, WA 98195 USA
[2] Univ Texas Austin, Oden Inst Computat Engn & Sci, Austin, TX USA
[3] Purdue Univ, Dept Aeronaut & Astronaut, W Lafayette, IN USA
[4] Univ Washington, Dept Elect Engn, Seattle, WA USA
关键词
Markov decision process; Incentive design; Congestion games; Online optimization; Transportation systems; Stochastic games; NASH EQUILIBRIA;
D O I
10.1016/j.automatica.2023.110879
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Under the Markov decision process (MDP) congestion game framework, we study the problem of enforcing population distribution constraints on a population of players with stochastic dynamics and coupled congestion costs. Existing research demonstrates that the constraints on the players' population distribution can be satisfied by enforcing tolls. However, computing the minimum toll value for constraint satisfaction requires accurate modeling of the player's congestion costs. Motivated by settings where an accurate congestion cost model may be unavailable (e.g. transportation networks), we consider an MDP congestion game with unknown congestion costs. We assume that a constraintenforcing authority can repeatedly enforce tolls on a population of players that converges to an epsilon-optimal population distribution for any given toll. We then construct a myopic update algorithm to compute the minimum toll value while ensuring that the constraints are satisfied on average. We analyze how the players' sub-optimal responses to tolls impact the rates of convergence towards the minimum toll value and constraint satisfaction. Finally, we construct a congestion game model for Uber drivers in Manhattan, New York City (NYC) using data from the Taxi and Limousine Commission (TLC) to illustrate how to efficiently reduce congestion while minimizing the impact on driver earnings.(c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:8
相关论文
共 33 条
[1]  
Alvia, 2021, NEW YORK
[2]   Convex Optimization: Algorithms and Complexity [J].
不详 .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2015, 8 (3-4) :232-+
[3]   A CONTINUOUS MODEL OF TRANSPORTATION [J].
Beckmann, Martin .
ECONOMETRICA, 1952, 20 (04) :643-660
[4]  
Bertsekas D.P., 1999, NONLINEAR PROGRAMMIN
[5]  
Calcea Nicu, 2017, NYCDOTS EXPERIENCE B
[6]   Markov Decision Process Routing Games [J].
Calderone, Dan ;
Sastry, S. Shankar .
2017 ACM/IEEE 8TH INTERNATIONAL CONFERENCE ON CYBER-PHYSICAL SYSTEMS (ICCPS), 2017, :273-279
[7]  
City of New York, 2021, TLC TRIP RECORD DATA
[8]   Decentralized probabilistic density control of autonomous swarms with safety constraints [J].
Demir, Nazli ;
Eren, Utku ;
Acikmese, Behcet .
AUTONOMOUS ROBOTS, 2015, 39 (04) :537-554
[9]   First-order methods of smooth convex optimization with inexact oracle [J].
Devolder, Olivier ;
Glineur, Francois ;
Nesterov, Yurii .
MATHEMATICAL PROGRAMMING, 2014, 146 (1-2) :37-75
[10]  
Durkin Erin, 2021, NEW YORK BECOMES 1 C