A Finite Horizon Markov Decision Process Based Reinforcement Learning Control of a Rapid Thermal Processing system

被引:8
作者
Pradeep, D. John [1 ]
Noel, Mathew Mithra [1 ]
机构
[1] VIT Univ, Sch Elect Engn, Vellore, Tamil Nadu, India
关键词
Reinforcement Learning; Rapid Thermal Processing; Nonlinear control; Markov Decision Process; Process control; Multivariable control; TEMPERATURE CONTROL; WAFER TEMPERATURE;
D O I
10.1016/j.jprocont.2018.06.002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Manufacture of ultra large-scale integrated circuits involves accurate control of a challenging nonlinear Rapid Thermal Processing (RTP) system. Precise control of temperature profile and rapid ramp-up and ramp-down rates demanded by a RTP system cannot be achieved with conventional control strategies due to nonlinear and multi time-scale effects. In this paper the control of a RTP system is reformulated as an optimal multi-step sequential decision problem using the framework of finite horizon Markov decision processes and solved using a Reinforcement Learning (RL) algorithm. Three increasingly complex RL based control strategies are explored and compared with the existing state-of-the-art approach for controlling RTPs. Simulation results indicate that the approach proposed in this paper achieves superior control of the temperature profile and ramp-up and ramp-down rates for the RTP system. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:218 / 225
页数:8
相关论文
共 21 条
[1]   Model-based control in rapid thermal processing [J].
Balakrishnan, KS ;
Edgar, TF .
THIN SOLID FILMS, 2000, 365 (02) :322-333
[2]  
Bertsekas D. P., 1996, NEURO DYNAMIC PROGRA, DOI [10.1109/MCSE.1998.683749, DOI 10.1109/MCSE.1998.683749]
[3]   Semi-empirical model-based multivariable iterative learning control of an RTP system [J].
Cho, M ;
Lee, Y ;
Joo, S ;
Lee, KS .
IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, 2005, 18 (03) :430-439
[4]   A learning approach of wafer temperature control in a rapid thermal processing system [J].
Choi, JY ;
Do, HM .
IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, 2001, 14 (01) :1-10
[5]   Modeling and temperature control of rapid thermal processing [J].
Dassau, E ;
Grosman, B ;
Lewin, DR .
COMPUTERS & CHEMICAL ENGINEERING, 2006, 30 (04) :686-697
[6]  
Dietterich T.G., 1996, ACM Computing Surveys (CSUR), V28, P3, DOI [DOI 10.1145/242224.242229, 10.1145/242224.242229]
[7]   Control strategies for thermal budget and temperature uniformity in spike rapid thermal processing systems [J].
Jeng, Jyh-Cheng ;
Chen, Wen-Chung .
COMPUTERS & CHEMICAL ENGINEERING, 2013, 57 :141-150
[8]   Reinforcement Learning for Partially Observable Dynamic Processes: Adaptive Dynamic Programming Using Measured Output Data [J].
Lewis, F. L. ;
Vamvoudakis, Kyriakos G. .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2011, 41 (01) :14-25
[9]  
Ng A., 2004, P INT S EXPT ROBOTIC, P1
[10]   Control of a nonlinear liquid level system using a new artificial neural network based reinforcement learning approach [J].
Noel, Mattew Mithra ;
Pandian, B. Jaganatha .
APPLIED SOFT COMPUTING, 2014, 23 :444-451