Tunnel ventilation control via an actor-critic algorithm employing nonparametric policy gradients

被引:8
作者
Chu, Baeksuk [2 ]
Hong, Daehie [2 ]
Park, Jooyoung [1 ]
机构
[1] Korea Univ, Dept Control & Instrumentat Engn, Chungnam 339700, South Korea
[2] Korea Univ, Div Mech Engn, Seoul 136701, South Korea
关键词
Actor-critic architecture; Nonparametric methods; Policy search; Reinforcement learning (RL); Tunnel ventilation control; SYSTEM;
D O I
10.1007/s12206-008-0924-5
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
The appropriate operation of a tunnel ventilation system provides drivers Passing through the tunnel,with comfortable and sale driving conditions. Tunnel ventilation involves maintaining CO pollutant concentration and VI (visibility index) under an adequate level with operating highly energy-consuming facilities Such as jet-fans. Therefore. it is significant to have an efficient operating algorithm in aspects of a safe driving environment as well as saving energy. In this research, a reinforcement learning (RL) method based on the actor-critic architecture and nonparametric policy gradients is applied as the control algorithm. The two objectives listed above, maintaining an adequate level Of Pollutants and minimizing power consumption, are included into a reward formulation that is a performance index to be maximized in the RL methodology. In this paper. a nonparametric approach is adopted as a promising route to perform a rigorous gradient search in a function space of policies to improve the efficacy of the actor module. Extensive Simulation Studies performed with real data collected front an existing tunnel system confirm that with the Suggested algorithm, the control purposes were well accomplished and improved when compared to a previously developed RL-based control algorithm.
引用
收藏
页码:311 / 323
页数:13
相关论文
共 20 条
[1]  
Bagnell J. A. D., 2003, CMURIT0345
[2]  
BLENDERMANN W, 1976, P 2 INT S AER VENT V
[3]   Simulation and measurement of road tunnel ventilation [J].
Bring, A ;
Malmstrom, TG ;
Boman, CA .
TUNNELLING AND UNDERGROUND SPACE TECHNOLOGY, 1997, 12 (03) :417-424
[4]   Application of fuzzy control to a road tunnel ventilation system [J].
Chen, PH ;
Lai, JH ;
Lin, CT .
FUZZY SETS AND SYSTEMS, 1998, 100 (1-3) :9-28
[5]   Optimal cross-coupled synchronizing control of dual-drive gantry system for a SMD assembly machine [J].
Chu, B ;
Kim, S ;
Hong, D ;
Park, HK ;
Park, J .
JSME INTERNATIONAL JOURNAL SERIES C-MECHANICAL SYSTEMS MACHINE ELEMENTS AND MANUFACTURING, 2004, 47 (03) :939-945
[6]   GA-based fuzzy controller design for tunnel ventilation systems [J].
Chu, Baeksuk ;
Kima, Dongnam ;
Hong, Daehie ;
Park, Jooyoung ;
Chungb, Jin Taek ;
Chung, Jae-Hun ;
Kim, Tae-Hyung .
AUTOMATION IN CONSTRUCTION, 2008, 17 (02) :130-136
[7]   The kernel recursive least-squares algorithm [J].
Engel, Y ;
Mannor, S ;
Meir, R .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (08) :2275-2285
[8]   Finding optimal ventilation control for highway tunnels [J].
Ferkl, Lukas ;
Meinsma, Gjerrit .
TUNNELLING AND UNDERGROUND SPACE TECHNOLOGY, 2007, 22 (02) :222-229
[9]  
FUNABASHI M, 1991, P IECON 91 INT C IND, V2, P1596
[10]   A novel approach to the transient ventilation of road tunnels [J].
Jang, HM ;
Chen, FL .
JOURNAL OF WIND ENGINEERING AND INDUSTRIAL AERODYNAMICS, 2000, 86 (01) :15-36