Domain Adaptation in Reinforcement Learning: Approaches, Limitations, and Future Directions

被引：0

作者：

Wang B. ^{[1
]}

机构：

[1] School Enterprise Cooperation and Employment Guidance Center, Zibo Vocational Institute, Shandong, Zibo

来源：

Journal of The Institution of Engineers (India): Series B | 2024年 / 105卷 / 05期

关键词：

Domain adaptation; Machine learning; Reinforcement learning; Survey;

D O I：

10.1007/s40031-024-01049-4

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Reinforcement learning (RL) has demonstrated impressive results in various fields; however, its performance can be significantly hindered when the training and testing environments differ. Domain adaptation (DA) techniques aim to bridge this gap by moving knowledge between domains. This paper presents a thorough and systematic study of DA in RL. We review and categorize existing approaches for DA in RL, including model-free and model-based. We examine the drawbacks associated with each approach, such as sample inefficiency and generalization issues. Furthermore, we explore various strategies used in DA, such as feature adaptation, reward shaping, and data augmentation. We provide insights into the benefits and drawbacks of different techniques and propose future research directions for enhancing DA in RL. Through this study, the goal is to offer comprehensive insight into the current state of DA in RL and contribute to developing more robust and adaptable RL algorithms. © The Institution of Engineers (India) 2024.

引用

页码：1223 / 1240

页数：17

共 50 条

[31] Reinforcement learning for dynamic multimedia adaptation
Charvillat, Vincent
Grigoras, Romulus
JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2007, 30 (03) : 1034 - 1058
[32] A Survey of Domain-Specific Architectures for Reinforcement Learning
Rothmann, Marc
Porrmann, Mario
IEEE ACCESS, 2022, 10 : 13753 - 13767
[33] The Role of Machine Learning in Game Development Domain - A Review of Current Trends and Future Directions
Edwards, Gemma
Subianto, Nicholas
Englund, David
Goh, Jun Wei
Coughran, Nathan
Milton, Zachary
Mirnateghi, Nima
Shah, Syed Afaq Ali
2021 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA 2021), 2021, : 495 - 501
[34] Fine-tuning Deep Reinforcement Learning Policies with r-STDP for Domain Adaptation
Akl, Mahmoud
Sandamirskaya, Yulia
Ergene, Deniz
Walter, Florian
Knoll, Alois
PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NEUROMORPHIC SYSTEMS 2022, ICONS 2022, 2022,
[35] A Survey on Cross-domain Recommendation: Taxonomies, Methods, and Future Directions
Zang, Tianzi
Zhu, Yanmin
Liu, Haobing
Zhang, Ruohan
Yu, Jiadi
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2023, 41 (02)
[36] Dynamics-Aware Adaptation for Reinforcement Learning Based Cross-Domain Interactive Recommendation
Wu, Junda
Xie, Zhihui
Yu, Tong
Zhao, Handong
Zhang, Ruiyi
Li, Shuai
PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 290 - 300
[37] Artificial Intelligence in pathology: current applications, limitations, and future directions
Sajithkumar, Akhil
Thomas, Jubin
Saji, Ajish Meprathumalil
Ali, Fousiya
Hasin, E. K. Haneena
Adampulan, Hannan Abdul Gafoor
Sarathchand, Swathy
IRISH JOURNAL OF MEDICAL SCIENCE, 2024, 193 (02) : 1117 - 1121
[38] Artificial Intelligence in pathology: current applications, limitations, and future directions
Akhil Sajithkumar
Jubin Thomas
Ajish Meprathumalil Saji
Fousiya Ali
Haneena Hasin E.K
Hannan Abdul Gafoor Adampulan
Swathy Sarathchand
Irish Journal of Medical Science (1971 -), 2024, 193 : 1117 - 1121
[39] Machine learning approaches to improve disease management of patients with rheumatoid arthritis: review and future directions
Kedra, Joanna
Davergne, Thomas
Braithwaite, Ben
Servy, Herve
Gossec, Laure
EXPERT REVIEW OF CLINICAL IMMUNOLOGY, 2021, 17 (12) : 1311 - 1321
[40] The Frontiers of Deep Reinforcement Learning for Resource Management in Future Wireless HetNets: Techniques, Challenges, and Research Directions
Alwarafy, Abdulmalik
Abdallah, Mohamed
Ciftler, Bekir Sait
Al-Fuqaha, Ala
Hamdi, Mounir
IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2022, 3 : 322 - 365

← 1 2 3 4 5 →