Ramp Metering Based on On-line ADHDP (λ) Controller

被引:5
作者
Bai, Xuerui [1 ]
Zhao, Dongbin [1 ]
Yi, Jianqiang [1 ]
Xu, Jing [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, Lab Complex Syst & Intelligence Sci, Beijing 100080, Peoples R China
来源
2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8 | 2008年
关键词
D O I
10.1109/IJCNN.2008.4634049
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Increasing dependence on car-based travel has led to the daily occurrence of freeway congestions around the world. In order to improve the worse and worse traffic congestion situation and solve the problems brought with it, a new kind of effective, fast, and robust method should be presented. Ramp metering has been developed as a traffic management strategy to alleviate congestion on freeways. But, it doesn't work well in uncertainty situations. In this paper, in order to solve the problems in uncertainty conditions, an on-tine learning control method based on the fundamental principle of reinforcement learning is proposed. The method is ADP (adaptive dynamic programming) and in order to expedite the learning rate, the concept about eligibility traces is introduced here. Then eligibility trace and ADP is combined to present a new kind of traffic responsive control method. The new method is called action-dependent heuristic dynamic programming based on eligibility traces (ADRDP (lambda)). ADHDP (lambda) is an approximate optimal ramp metering method. Simulation studies on a hypothetical freeway indicate good control performance of the proposed real-time traffic controller.
引用
收藏
页码:1847 / 1852
页数:6
相关论文
共 17 条
[1]  
GOLSTEIN NB, 1982, TRANSPN RES B, V16, P279
[2]   Reinforcement learning: A survey [J].
Kaelbling, LP ;
Littman, ML ;
Moore, AW .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1996, 4 :237-285
[3]  
LENDARIS GG, 1997, P 1997 IEEE INT C NE, V6, P712
[4]   ON KINEMATIC WAVES .2. A THEORY OF TRAFFIC FLOW ON LONG CROWDED ROADS [J].
LIGHTHILL, MJ ;
WHITHAM, GB .
PROCEEDINGS OF THE ROYAL SOCIETY OF LONDON SERIES A-MATHEMATICAL AND PHYSICAL SCIENCES, 1955, 229 (1178) :317-345
[5]  
LIU DR, 2001, P INT JOINT C NEUR N, V2, P15
[6]  
Nsour S. A., 1992, TRANSPORT RES REC, V1365, P116
[7]   Freeway ramp metering: An overview [J].
Papageorgiou, M ;
Kotsialos, A .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2002, 3 (04) :271-281
[8]  
Papageorgiou M., 1991, Transportation Research Record, V1320, P58
[9]   Adaptive critic designs [J].
Prokhorov, DV ;
Wunsch, DC .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1997, 8 (05) :997-1007
[10]  
ROBINSON J, RAMP METERING STATUS