Reinforcement Learning With Composite Rewards for Production Scheduling in a Smart Factory

被引:35
|
作者
Zhou, Tong [1 ]
Tang, Dunbing [1 ]
Zhu, Haihua [1 ]
Wang, Liping [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Mech & Elect Engn, Nanjing 210016, Peoples R China
基金
中国国家自然科学基金;
关键词
Production scheduling; reinforcement learning; composite reward; smart factory; neural network;
D O I
10.1109/ACCESS.2020.3046784
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Rapid advances of sensing and cloud technologies transform the manufacturing system into a data-rich environment and make production scheduling increasingly complex. Traditional offline scheduling methods are limited in the ability to handle low-volume-high-mix workorders with diverse design specifications. Simulation-based methods show the promise for distributed scheduling of manufacturing jobs but are mostly implemented with historical data and empirical rules in a static manner. Recently, artificial intelligence (AI) algorithms fuel increasing interests to solve dynamic scheduling problems in the manufacturing setting. However, it's difficult to utilize high-dimensional data for production scheduling while considering multiple practical objectives for smart manufacturing (e.g., minimize the makespan, reduce production costs, balance workloads). Therefore, this paper presents a new AI scheduler with composite reward functions for data-driven dynamic scheduling of manufacturing jobs under uncertainty in a smart factory. Internet-enabled sensor networks are deployed in the smart factory to track real-time statuses of workorders, machines, and material handling systems. A novel manufacturing value network is developed to take high-dimensional data as the input and then learn the state-action values for real-time decision making. Based on reinforcement learning (RL), composite rewards help the AI scheduler learn efficiently to achieve multiple objectives for production scheduling in real time. The proposed methodology is evaluated and validated with experimental studies in a smart manufacturing setting. Experimental results show that the new AI scheduler not only improves the multi-objective performance metrics in the production scheduling problem but also effectively copes with unexpected events (e.g., urgent workorders, machine failures) in manufacturing systems.
引用
收藏
页码:752 / 766
页数:15
相关论文
共 50 条
  • [1] Real-time scheduling for a smart factory using a reinforcement learning approach
    Shiue, Yeou-Ren
    Lee, Ken-Chuan
    Su, Chao-Ton
    COMPUTERS & INDUSTRIAL ENGINEERING, 2018, 125 : 604 - 614
  • [2] Reinforcement learning for online optimization of job-shop scheduling in a smart manufacturing factory
    Zhou, Tong
    Zhu, Haihua
    Tang, Dunbing
    Liu, Changchun
    Cai, Qixiang
    Shi, Wei
    Gui, Yong
    ADVANCES IN MECHANICAL ENGINEERING, 2022, 14 (03)
  • [3] Smart Master Production Scheduling by Deep Reinforcement Learning: An Exploratory Analysis
    Serrano-Ruiz, Julio C.
    Mula, Josefa
    Poler, Raul
    Diaz-Madronero, Manuel
    NAVIGATING UNPREDICTABILITY: COLLABORATIVE NETWORKS IN NON-LINEAR WORLDS, PRO-VE 2024, PT II, 2024, 727 : 228 - 244
  • [4] A Deep Reinforcement Learning Approach for Smart Coordination Between Production Planning and Scheduling
    Gomez-Gasquet, Pedro
    Boza, Andres
    Perez Perales, David
    Esteso, Ana
    ENTERPRISE INTEROPERABILITY X, EI 2022, 2024, 11 : 195 - 206
  • [5] Digital Twin and Reinforcement Learning-Based Resilient Production Control for Micro Smart Factory
    Park, Kyu Tae
    Son, Yoo Ho
    Ko, Sang Wook
    Noh, Sang Do
    APPLIED SCIENCES-BASEL, 2021, 11 (07):
  • [6] Smart Scheduling of Electric Vehicles Based on Reinforcement Learning
    Viziteu, Andrei
    Furtuna, Daniel
    Robu, Andrei
    Senocico, Stelian
    Cioata, Petru
    Remus Baltariu, Marian
    Filote, Constantin
    Raboaca, Maria Simona
    SENSORS, 2022, 22 (10)
  • [7] Hierarchical reinforcement learning with unlimited option scheduling for sparse rewards in continuous spaces
    Huang, Zhigang
    Liu, Quan
    Zhu, Fei
    Zhang, Lihua
    Wu, Lan
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237
  • [8] Deep Reinforcement Learning for Semiconductor Production Scheduling
    Waschneck, Bernd
    Reichstaller, Andre
    Belzner, Lenz
    Altenmueller, Thomas
    Bauernhansl, Thomas
    Knapp, Alexander
    Kyek, Andreas
    2018 29TH ANNUAL SEMI ADVANCED SEMICONDUCTOR MANUFACTURING CONFERENCE (ASMC), 2018, : 301 - 306
  • [9] Reinforcement Learning with Perturbed Rewards
    Wang, Jingkang
    Liu, Yang
    Li, Bo
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 6202 - 6209
  • [10] Smart Security Guard Scheduling System Based On the Reinforcement Learning
    Jiang, Cheng
    Kusakunniran, Worapan
    Pornprasatpol, Natchanon
    Limsuwankesorn, Chanapai
    Li, Yi
    2017 21ST INTERNATIONAL COMPUTER SCIENCE AND ENGINEERING CONFERENCE (ICSEC 2017), 2017, : 214 - 218