Reinforcement learning for real-time process control in high-temperature superconductor manufacturing

被引:3
作者
Feng, Qianmei [1 ,2 ]
Peng, Shenglin [1 ]
Lin, Ying [1 ,2 ]
Chen, Siwei [2 ,3 ,4 ,5 ]
Paidpilli, Mahesh [2 ,3 ,4 ]
Goel, Chirag [2 ,3 ,4 ]
Galstyan, Eduard [2 ,3 ,4 ]
Selvamanickam, Venkat [2 ,3 ,4 ]
机构
[1] Univ Houston, Dept Ind Engn, Houston, TX 77204 USA
[2] Univ Houston, Adv Mfg Inst, Houston, TX 77204 USA
[3] Univ Houston, Dept Mech Engn, Houston, TX 77204 USA
[4] Texas Ctr Superconduct, Houston, TX 77204 USA
[5] Princeton Plasma Phys Lab, Princeton, NJ 08540 USA
基金
美国国家科学基金会;
关键词
High-temperature superconductor; Reinforcement learning; Function approximation; Artificial neural network; Process control; CRITICAL-CURRENT DENSITY; YBA2CU3O7-DELTA FILMS; DEPOSITION; THICK;
D O I
10.1007/s00170-023-12369-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With high efficiency and low energy loss, high-temperature superconductors (HTS) have demonstrated their profound applications in various fields, such as medical imaging, transportation, accelerators, microwave devices, and power systems. The high-field applications of HTS tapes have raised the demand for producing cost-effective tapes with long lengths in superconductor manufacturing. However, achieving the uniform and enhanced performance of a long HTS tape is challenging due to the unstable growth conditions in the manufacturing process. Although it is confirmed that the process parameters during the advanced metal organic chemical vapor deposition (A-MOCVD) process influence the uniformity of the produced HTS tapes, the high-dimensional process parameter signals and their complicated interactions make it difficult to develop an effective control policy. In this paper, we propose a local measure for the uniformity of HTS tapes to provide instant feedback for our control policy. Then, we model the manufacturing of HTS tapes as a Markov decision process (MDP) with continuous state and action spaces to assess the instant reward in real time in our feedback control model. As our MDP involves continuous and high-dimensional state and action spaces, a neural fitted Q-iteration (NFQ) algorithm is adopted to solve the MDP with artificial neural network (ANN) function approximation. The collinearity of process parameters can restrict our capability of adjusting the process parameters, which is addressed by the principal component analysis (PCA) in our method. The control policy adjusts the PCA of process parameters using the NFQ algorithm. Based on our case studies on real A-MOCVD dataset, the obtained control policy increases the average uniformity of tapes by 5.6% and performs especially well on sample HTS tapes with a low uniformity.
引用
收藏
页码:2215 / 2225
页数:11
相关论文
共 38 条
[1]  
Ballarino A, 2000, APPL HIGH TEMPERATUR
[2]   POSSIBLE HIGH-TC SUPERCONDUCTIVITY IN THE BA-LA-CU-O SYSTEM [J].
BEDNORZ, JG ;
MULLER, KA .
ZEITSCHRIFT FUR PHYSIK B-CONDENSED MATTER, 1986, 64 (02) :189-193
[3]   The critical current of superconductors: an historical review [J].
Dew-Hughes, D .
LOW TEMPERATURE PHYSICS, 2001, 27 (9-10) :713-722
[4]  
Ernst D, 2005, J MACH LEARN RES, V6, P503
[5]   Application of high temperature superconductors for fusion [J].
Fietz, W. H. ;
Heller, R. ;
Schlachter, S. I. ;
Goldacker, W. .
FUSION ENGINEERING AND DESIGN, 2011, 86 (6-8) :1365-1368
[6]   Impact of critical current fluctuations on the performance of a coated conductor tape [J].
Gomory, Fedor ;
Souc, Jan ;
Adamek, Miroslav ;
Ghabeli, Asef ;
Solovyov, Mykola ;
Vojenciak, Michel .
SUPERCONDUCTOR SCIENCE & TECHNOLOGY, 2019, 32 (12)
[7]  
Higashikawa K., 2016, IEEE T APPL SUPERCON, V27, P1
[8]  
Hu X., 2016, IEEE T APPL SUPERCON, V27, P1, DOI DOI 10.1109/TASC.2016.2637330
[9]   Applications of high-temperature superconductors in power technology [J].
Hull, JR .
REPORTS ON PROGRESS IN PHYSICS, 2003, 66 (11) :1865-1886
[10]  
Kalsi S. S., 2011, APPL HIGH TEMPERATUR