A Finite Horizon Markov Decision Process Based Reinforcement Learning Control of a Rapid Thermal Processing system

被引：8

作者：

Pradeep, D. John ^{[1
]}

Noel, Mathew Mithra ^{[1
]}

机构：

[1] VIT Univ, Sch Elect Engn, Vellore, Tamil Nadu, India

来源：

JOURNAL OF PROCESS CONTROL | 2018年 / 68卷

关键词：

Reinforcement Learning; Rapid Thermal Processing; Nonlinear control; Markov Decision Process; Process control; Multivariable control; TEMPERATURE CONTROL; WAFER TEMPERATURE;

D O I：

10.1016/j.jprocont.2018.06.002

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Manufacture of ultra large-scale integrated circuits involves accurate control of a challenging nonlinear Rapid Thermal Processing (RTP) system. Precise control of temperature profile and rapid ramp-up and ramp-down rates demanded by a RTP system cannot be achieved with conventional control strategies due to nonlinear and multi time-scale effects. In this paper the control of a RTP system is reformulated as an optimal multi-step sequential decision problem using the framework of finite horizon Markov decision processes and solved using a Reinforcement Learning (RL) algorithm. Three increasingly complex RL based control strategies are explored and compared with the existing state-of-the-art approach for controlling RTPs. Simulation results indicate that the approach proposed in this paper achieves superior control of the temperature profile and ramp-up and ramp-down rates for the RTP system. (C) 2018 Elsevier Ltd. All rights reserved.

引用

页码：218 / 225

页数：8

共 21 条

[1] Model-based control in rapid thermal processing [J].

Balakrishnan, KS ;

Edgar, TF .

THIN SOLID FILMS, 2000, 365 (02) :322-333

[2]

Bertsekas D. P., 1996, NEURO DYNAMIC PROGRA, DOI [10.1109/MCSE.1998.683749, DOI 10.1109/MCSE.1998.683749]

[3] Semi-empirical model-based multivariable iterative learning control of an RTP system [J].

Cho, M ;

Lee, Y ;

Joo, S ;

Lee, KS .

IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, 2005, 18 (03) :430-439

[4] A learning approach of wafer temperature control in a rapid thermal processing system [J].

Choi, JY ;

Do, HM .

IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, 2001, 14 (01) :1-10

[5] Modeling and temperature control of rapid thermal processing [J].

Dassau, E ;

Grosman, B ;

Lewin, DR .

COMPUTERS & CHEMICAL ENGINEERING, 2006, 30 (04) :686-697

[6]

Dietterich T.G., 1996, ACM Computing Surveys (CSUR), V28, P3, DOI [DOI 10.1145/242224.242229, 10.1145/242224.242229]

[7] Control strategies for thermal budget and temperature uniformity in spike rapid thermal processing systems [J].

Jeng, Jyh-Cheng ;

Chen, Wen-Chung .

COMPUTERS & CHEMICAL ENGINEERING, 2013, 57 :141-150

[8] Reinforcement Learning for Partially Observable Dynamic Processes: Adaptive Dynamic Programming Using Measured Output Data [J].

Lewis, F. L. ;

Vamvoudakis, Kyriakos G. .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2011, 41 (01) :14-25

[9]

Ng A., 2004, P INT S EXPT ROBOTIC, P1

[10] Control of a nonlinear liquid level system using a new artificial neural network based reinforcement learning approach [J].

Noel, Mattew Mithra ;

Pandian, B. Jaganatha .

APPLIED SOFT COMPUTING, 2014, 23 :444-451

← 1 2 3 →