A model-based deep reinforcement learning approach to the nonblocking coordination of modular supervisors of discrete event systems

被引:9
作者
Yang, Junjun [1 ]
Tan, Kaige [2 ]
Feng, Lei [2 ]
Li, Zhiwu [1 ,3 ]
机构
[1] Xidian Univ, Sch Electromech Engn, Xian 710071, Peoples R China
[2] KTH Royal Inst Technol, Dept Machine Design, S-10044 Stockholm, Sweden
[3] Macau Univ Sci & Technol, Inst Syst Engn, Taipa, Macau, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Deep reinforcement learning; Discrete event system; Local modular control; Supervisory control theory; COMPLEXITY; DESIGN;
D O I
10.1016/j.ins.2023.02.033
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Modular supervisory control may lead to conflicts among the modular supervisors for large-scale discrete event systems. The existing methods for ensuring nonblocking control of modular supervisors either exploit favorable structures in the system model to guarantee the nonblocking property of modular supervisors or employ hierarchical model abstraction methods for reducing the computational complexity of designing a nonblocking coordinator. The nonblocking modular control problem is, in general, NP-hard. This study integrates supervisory control theory and a model-based deep reinforcement learning method to synthesize a nonblocking coordinator for the modular supervisors. The deep reinforcement learning method significantly reduces the computational complexity by avoiding the computation of synchronization of multiple modular supervisors and the plant models. The supervisory control function is approximated by the deep neural network instead of a large-sized finite automaton. Furthermore, the proposed model-based deep reinforcement learning method is more efficient than the standard deep Q network algorithm.
引用
收藏
页码:305 / 321
页数:17
相关论文
共 50 条
[41]   Long-Term Recommendation Model for Online Education Systems: A Deep Reinforcement Learning Approach [J].
Wang, Wei .
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2025, 16 (02) :342-350
[42]   An End-to-End Deep Reinforcement Learning Based Modular Task Allocation Framework for Autonomous Mobile Systems [J].
Ma, Song ;
Ruan, Jingqing ;
Du, Yali ;
Bucknall, Richard ;
Liu, Yuanchang .
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 :1519-1533
[43]   An End-to-End Deep Reinforcement Learning Based Modular Task Allocation Framework for Autonomous Mobile Systems [J].
Ma, Song ;
Ruan, Jingqing ;
Du, Yali ;
Bucknall, Richard ;
Liu, Yuanchang .
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 :1519-1533
[44]   A model-based approach to associate complexity and robustness in engineering systems [J].
Gohler, Simon Moritz ;
Frey, Daniel D. ;
Howard, Thomas J. .
RESEARCH IN ENGINEERING DESIGN, 2017, 28 (02) :223-234
[45]   A deep reinforcement learning based hyper-heuristic for modular production control [J].
Panzer, Marcel ;
Bender, Benedict ;
Gronau, Norbert .
INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2024, 62 (08) :2747-2768
[46]   Deep reinforcement learning-based model predictive control of uncertain linear systems [J].
Hu, Pengcheng ;
Cao, Xinyuan ;
Zhang, Kunwu ;
Shi, Yang .
2024 IEEE 7TH INTERNATIONAL CONFERENCE ON INDUSTRIAL CYBER-PHYSICAL SYSTEMS, ICPS 2024, 2024,
[47]   State Predictive Control of Modular SMES Magnet Based on Deep Reinforcement Learning [J].
Zhang, Zitong ;
Shi, Jing ;
Guo, Shuqiang ;
Yang, Wangwang ;
Lin, Dengquan ;
Xu, Ying ;
Ren, Li .
IEEE TRANSACTIONS ON APPLIED SUPERCONDUCTIVITY, 2022, 32 (06)
[48]   Deep and Shallow Learning Model-Based Sleep Apnea Diagnosis Systems: A Comprehensive Study [J].
Raisa, Roksana Akter ;
Rodela, Ayesha Siddika ;
Abu Yousuf, Mohammad ;
Azad, Akm ;
Alyami, Salem A. ;
Lio, Pietro ;
Islam, Md Zahidul ;
Pogrebna, Ganna ;
Moni, Mohammad Ali .
IEEE ACCESS, 2024, 12 :122959-122987
[49]   A Deep Reinforcement Learning Based Approach for Intelligent Reconfigurable Surface Elements Selection [J].
Zan, Shiming ;
Pang, Yu ;
Gravina, Raffaele ;
Cao, Enling ;
Li, Ye ;
Zang, Weilin .
2022 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/CBDCOM/CYBERSCITECH), 2022, :56-62
[50]   Directed control of discrete event systems: Optimization based approach [J].
Huang, Jing ;
Kumar, Ratnesh .
2006 AMERICAN CONTROL CONFERENCE, VOLS 1-12, 2006, 1-12 :4141-4146