Reinforcement Learning With Composite Rewards for Production Scheduling in a Smart Factory

被引：35

作者：

Zhou, Tong ^{[1
]}

Tang, Dunbing ^{[1
]}

Zhu, Haihua ^{[1
]}

Wang, Liping ^{[1
]}

机构：

[1] Nanjing Univ Aeronaut & Astronaut, Coll Mech & Elect Engn, Nanjing 210016, Peoples R China

来源：

IEEE ACCESS | 2021年 / 9卷

基金：

中国国家自然科学基金;

关键词：

Production scheduling; reinforcement learning; composite reward; smart factory; neural network;

D O I：

10.1109/ACCESS.2020.3046784

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Rapid advances of sensing and cloud technologies transform the manufacturing system into a data-rich environment and make production scheduling increasingly complex. Traditional offline scheduling methods are limited in the ability to handle low-volume-high-mix workorders with diverse design specifications. Simulation-based methods show the promise for distributed scheduling of manufacturing jobs but are mostly implemented with historical data and empirical rules in a static manner. Recently, artificial intelligence (AI) algorithms fuel increasing interests to solve dynamic scheduling problems in the manufacturing setting. However, it's difficult to utilize high-dimensional data for production scheduling while considering multiple practical objectives for smart manufacturing (e.g., minimize the makespan, reduce production costs, balance workloads). Therefore, this paper presents a new AI scheduler with composite reward functions for data-driven dynamic scheduling of manufacturing jobs under uncertainty in a smart factory. Internet-enabled sensor networks are deployed in the smart factory to track real-time statuses of workorders, machines, and material handling systems. A novel manufacturing value network is developed to take high-dimensional data as the input and then learn the state-action values for real-time decision making. Based on reinforcement learning (RL), composite rewards help the AI scheduler learn efficiently to achieve multiple objectives for production scheduling in real time. The proposed methodology is evaluated and validated with experimental studies in a smart manufacturing setting. Experimental results show that the new AI scheduler not only improves the multi-objective performance metrics in the production scheduling problem but also effectively copes with unexpected events (e.g., urgent workorders, machine failures) in manufacturing systems.

引用

页码：752 / 766

页数：15

共 50 条

[1] Real-time scheduling for a smart factory using a reinforcement learning approach
Shiue, Yeou-Ren
Lee, Ken-Chuan
Su, Chao-Ton
COMPUTERS & INDUSTRIAL ENGINEERING, 2018, 125 : 604 - 614
[2] Reinforcement learning for online optimization of job-shop scheduling in a smart manufacturing factory
Zhou, Tong
Zhu, Haihua
Tang, Dunbing
Liu, Changchun
Cai, Qixiang
Shi, Wei
Gui, Yong
ADVANCES IN MECHANICAL ENGINEERING, 2022, 14 (03)
[3] Smart Master Production Scheduling by Deep Reinforcement Learning: An Exploratory Analysis
Serrano-Ruiz, Julio C.
Mula, Josefa
Poler, Raul
Diaz-Madronero, Manuel
NAVIGATING UNPREDICTABILITY: COLLABORATIVE NETWORKS IN NON-LINEAR WORLDS, PRO-VE 2024, PT II, 2024, 727 : 228 - 244
[4] A Deep Reinforcement Learning Approach for Smart Coordination Between Production Planning and Scheduling
Gomez-Gasquet, Pedro
Boza, Andres
Perez Perales, David
Esteso, Ana
ENTERPRISE INTEROPERABILITY X, EI 2022, 2024, 11 : 195 - 206
[5] Digital Twin and Reinforcement Learning-Based Resilient Production Control for Micro Smart Factory
Park, Kyu Tae
Son, Yoo Ho
Ko, Sang Wook
Noh, Sang Do
APPLIED SCIENCES-BASEL, 2021, 11 (07):
[6] Smart Scheduling of Electric Vehicles Based on Reinforcement Learning
Viziteu, Andrei
Furtuna, Daniel
Robu, Andrei
Senocico, Stelian
Cioata, Petru
Remus Baltariu, Marian
Filote, Constantin
Raboaca, Maria Simona
SENSORS, 2022, 22 (10)
[7] Hierarchical reinforcement learning with unlimited option scheduling for sparse rewards in continuous spaces
Huang, Zhigang
Liu, Quan
Zhu, Fei
Zhang, Lihua
Wu, Lan
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237
[8] Deep Reinforcement Learning for Semiconductor Production Scheduling
Waschneck, Bernd
Reichstaller, Andre
Belzner, Lenz
Altenmueller, Thomas
Bauernhansl, Thomas
Knapp, Alexander
Kyek, Andreas
2018 29TH ANNUAL SEMI ADVANCED SEMICONDUCTOR MANUFACTURING CONFERENCE (ASMC), 2018, : 301 - 306
[9] Reinforcement Learning with Perturbed Rewards
Wang, Jingkang
Liu, Yang
Li, Bo
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 6202 - 6209
[10] Smart Security Guard Scheduling System Based On the Reinforcement Learning
Jiang, Cheng
Kusakunniran, Worapan
Pornprasatpol, Natchanon
Limsuwankesorn, Chanapai
Li, Yi
2017 21ST INTERNATIONAL COMPUTER SCIENCE AND ENGINEERING CONFERENCE (ICSEC 2017), 2017, : 214 - 218

← 1 2 3 4 5 →