Application of Reinforcement Learning in Decision Systems: Lift Control Case Study

被引：2

作者：

Wojtulewicz, Mateusz ^{[1
]}

Szmuc, Tomasz ^{[2
]}

机构：

[1] AGH Univ Krakow, Ctr Excellence Artificial Intelligence, PL-30059 Krakow, Poland

[2] AGH Univ Krakow, Fac Elect Engn, Dept Appl Comp Sci, Automat, PL-30059 Krakow, Poland

来源：

APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 02期

关键词：

decision systems; artificial intelligence; reinforcement learning; lift control; ELEVATOR GROUP CONTROL;

D O I：

10.3390/app14020569

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

This study explores the application of reinforcement learning (RL) algorithms to optimize lift control strategies. By developing a versatile lift simulator enriched with real-world traffic data from an intelligent building system, we systematically compare RL-based strategies against well-established heuristic solutions. The research evaluates their performance using predefined metrics to improve our understanding of RL's effectiveness in solving complex decision problems, such as the lift control algorithm. The results of the experiments show that all trained agents developed strategies that outperform the heuristic algorithms in every metric. Furthermore, the study conducts a comprehensive exploration of three Experience Replay mechanisms, aiming to enhance the performance of the chosen RL algorithm, Deep Q-Learning.

引用

页数：12

共 25 条

[1]

Abadi M., 2015, TENSORFLOW LARGE SCA

[2] Elevator group control using multiple reinforcement learning agents [J].

Crites, RH ;

Barto, AG .

MACHINE LEARNING, 1998, 33 (2-3) :235-262

[3]

Crites RobertH., 1996, Advances in Neural Information Processing Systems

[4] Array programming with NumPy [J].

Harris, Charles R. ;

Millman, K. Jarrod ;

van der Walt, Stefan J. ;

Gommers, Ralf ;

Virtanen, Pauli ;

Cournapeau, David ;

Wieser, Eric ;

Taylor, Julian ;

Berg, Sebastian ;

Smith, Nathaniel J. ;

Kern, Robert ;

Picus, Matti ;

Hoyer, Stephan ;

van Kerkwijk, Marten H. ;

Brett, Matthew ;

Haldane, Allan ;

del Rio, Jaime Fernandez ;

Wiebe, Mark ;

Peterson, Pearu ;

Gerard-Marchant, Pierre ;

Sheppard, Kevin ;

Reddy, Tyler ;

Weckesser, Warren ;

Abbasi, Hameer ;

Gohlke, Christoph ;

Oliphant, Travis E. .

NATURE, 2020, 585 (7825) :357-362

[5]

Horgan D, 2018, Arxiv, DOI arXiv:1803.00933

[6]

IBM, 2010, The Smarter Buildings Survey

[7]

IMASAKI N, 1995, PROCEEDINGS OF 1995 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS I-IV, P1735, DOI 10.1109/FUZZY.1995.409916

[8]

Li H, 2015, IEEE INT C EMERG

[9]

Li S.E., 2023, Reinforcement Learning for Sequential Decision and Optimal Control, P365

[10]

Liang C.J.M., 2013, P 5 ACM WORKSH EMB S, P1, DOI [10.1145/2528282.2528314, DOI 10.1145/2528282.2528314]

← 1 2 3 →