Multiobjective Reinforcement Learning for Traffic Signal Control Using Vehicular Ad Hoc Network

被引:30
作者
Duan Houli [1 ]
Li Zhiheng [1 ]
Zhang Yi [1 ]
机构
[1] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
关键词
FUZZY-LOGIC-CONTROLLER;
D O I
10.1155/2010/724035
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We propose a newmultiobjective control algorithm based on reinforcement learning for urban traffic signal control, named multi-RL. A multiagent structure is used to describe the traffic system. A vehicular ad hoc network is used for the data exchange among agents. A reinforcement learning algorithm is applied to predict the overall value of the optimization objective given vehicles' states. The policy which minimizes the cumulative value of the optimization objective is regarded as the optimal one. In order to make the method adaptive to various traffic conditions, we also introduce a multiobjective control scheme in which the optimization objective is selected adaptively to real-time traffic states. The optimization objectives include the vehicle stops, the average waiting time, and the maximum queue length of the next intersection. In addition, we also accommodate a priority control to the buses and the emergency vehicles through our model. The simulation results indicated that our algorithm could perform more efficiently than traditional traffic light control methods.
引用
收藏
页数:7
相关论文
共 15 条
  • [1] Reinforcement learning for True Adaptive traffic signal control
    Abdulhai, B
    Pringle, R
    Karakoulas, GJ
    [J]. JOURNAL OF TRANSPORTATION ENGINEERING, 2003, 129 (03) : 278 - 285
  • [2] [Anonymous], 1989, THESIS CAMBRIDGE U
  • [3] Queue spillovers in transportation networks with a route choice
    Daganzo, CF
    [J]. TRANSPORTATION SCIENCE, 1998, 32 (01) : 3 - 11
  • [4] Foy M. D., 1992, TRANSPORTATION RES R
  • [5] Reinforcement learning: A survey
    Kaelbling, LP
    Littman, ML
    Moore, AW
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1996, 4 : 237 - 285
  • [6] Liu Z., 1997, Information and Control, V26, P441
  • [7] Mikami S., 1994, P 1 IEEE C EV COMP O, V1, P223
  • [8] FUZZY LOGIC-CONTROLLER FOR A TRAFFIC JUNCTION
    PAPPIS, CP
    MAMDANI, EH
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1977, 7 (10): : 707 - 717
  • [9] PARK B, 2000, TRANSPORTATION RES R
  • [10] Traffic-responsive signal timing for system-wide traffic control
    Spall, JC
    Chin, DC
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 1997, 5 (3-4) : 153 - 163