Deploying Deep Reinforcement Learning Systems: A Taxonomy of Challenges

被引:2
作者
Yahmed, Ahmed Haj [1 ]
Abbassi, Altaf Allah [1 ]
Nikanjam, Amin [1 ]
Li, Heng [1 ]
Khomh, Foutse [1 ]
机构
[1] Polytech Montreal, Montreal, PQ, Canada
来源
2023 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION, ICSME | 2023年
关键词
Empirical; Deep Reinforcement Learning; Software Deployment; Taxonomy of Challenges; Stack Overflow;
D O I
10.1109/ICSME58846.2023.00015
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Deep reinforcement learning (DRL), leveraging Deep Learning (DL) in reinforcement learning, has shown significant potential in achieving human-level autonomy in a wide range of domains, including robotics, computer vision, and computer games. This potential justifies the enthusiasm and growing interest in DRL in both academia and industry. However, the community currently focuses mostly on the development phase of DRL systems, with little attention devoted to DRL deployment. In this paper, we propose an empirical study on Stack Overflow (SO), the most popular Q&A forum for developers, to uncover and understand the challenges practitioners faced when deploying DRL systems. Specifically, we categorized relevant SO posts by deployment platforms: server/cloud, mobile/embedded system, browser, and game engine. After filtering and manual analysis, we examined 357 SO posts about DRL deployment, investigated the current state, and identified the challenges related to deploying DRL systems. Then, we investigate the prevalence and difficulty of these challenges. Results show that the general interest in DRL deployment is growing, confirming the study's relevance and importance. Results also show that DRL deployment is more difficult than other DRL issues. Additionally, we built a taxonomy of 31 unique challenges in deploying DRL to different platforms. On all platforms, RL environment-related challenges are the most popular, and communication-related challenges are the most difficult among practitioners. We hope our study inspires future research and helps the community overcome the most common and difficult challenges practitioners face when deploying DRL systems.
引用
收藏
页码:26 / 38
页数:13
相关论文
共 82 条
[1]  
16Aghnar, 2020, Reinforcement Learning coach: Saver fails to restore agent's checkpoint
[2]   What Do Developers Use the Crowd For? A Study Using Stack Overflow [J].
Abdalkareem, Rabe ;
Shihab, Emad ;
Rilling, Juergen .
IEEE SOFTWARE, 2017, 34 (02) :53-60
[3]   Challenges in Chatbot Development: A Study of Stack Overflow Posts [J].
Abdellatif, Ahmad ;
Costa, Diego ;
Badran, Khaled ;
Abdalkareem, Rabe ;
Shihab, Emad .
2020 IEEE/ACM 17TH INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES, MSR, 2020, :174-185
[4]   What Do Concurrency Developers Ask About? A Large-scale Study Using Stack Overflow [J].
Ahmed, Syed ;
Bagherzadeh, Mehdi .
PROCEEDINGS OF THE 12TH ACM/IEEE INTERNATIONAL SYMPOSIUM ON EMPIRICAL SOFTWARE ENGINEERING AND MEASUREMENT (ESEM 2018), 2018,
[5]  
ak, 2021, Reinforcement learning-custom environment implementation in java for python rl framework
[6]  
Alshangiti Moayad, 2019, INT SYMP EMP SOFTWAR, P117, DOI [DOI 10.1109/esem.2019.8870187, 10.1109/ESEM.2019.8870187]
[7]  
[Anonymous], 2006, P 23 INT C MACHINE L
[8]  
[Anonymous], 2022, Stable baselines3
[9]  
Apple Inc, 2017, Core ml
[10]  
archive.org, Stack exchange data dump