Dynamic scheduling for semiconductor manufacturing systems with uncertainties using convolutional neural networks and reinforcement learning

被引：0

作者：

Juan Liu

Fei Qiao

Minjie Zou

Jonas Zinn

Yumin Ma

Birgit Vogel-Heuser

机构：

[1] Tongji University,College of Electronic and Information Engineering

[2] Technical University of Munich,Institute of Automation and Information Systems

来源：

Complex & Intelligent Systems | 2022年 / 8卷

关键词：

Dynamic production scheduling; Uncertainties; Deep reinforcement learning (DRL); Convolutional neural networks (CNN); Rule-based; [inline-graphic not available: see fulltext]; [inline-graphic not available: see fulltext]; [inline-graphic not available: see fulltext]; [inline-graphic not available: see fulltext]; [inline-graphic not available: see fulltext];

D O I：

暂无

中图分类号：

学科分类号：

摘要：

The dynamic scheduling problem of semiconductor manufacturing systems (SMSs) is becoming more complicated and challenging due to internal uncertainties and external demand changes. To this end, this paper addresses integrated release control and production scheduling problems with uncertain processing times and urgent orders and proposes a convolutional neural network and asynchronous advanced actor critic-based method (CNN-A3C) that involves a training phase and a deployment phase. In the training phase, actor–critic networks are trained to predict the evaluation of scheduling decisions and to output the optimal scheduling decision. In the deployment phase, the most appropriate release control and scheduling decisions are periodically generated according to the current production status based on the networks. Furthermore, we improve the four key points in the deep reinforcement learning (DRL) algorithm, state space, action space, reward function, and network structure and design four mechanisms: a slide-window-based two-dimensional state perception mechanism, an adaptive reward function that considers multiple objectives and automatically adjusts to dynamic events, a continuous action space based on composite dispatching rules (CDR) and release strategies, and actor–critic networks based on convolutional neural networks (CNNs). To verify the feasibility and effectiveness of the proposed dynamic scheduling method, it is implemented on a simplified SMS. The simulation experimental results show that the proposed method outperforms the unimproved A3C-based method and the common dispatching rules under the new uncertain scenarios.

引用

页码：4641 / 4662

页数：21