Domain Adaptation in Reinforcement Learning: Approaches, Limitations, and Future Directions

被引:0
|
作者
Wang B. [1 ]
机构
[1] School Enterprise Cooperation and Employment Guidance Center, Zibo Vocational Institute, Shandong, Zibo
关键词
Domain adaptation; Machine learning; Reinforcement learning; Survey;
D O I
10.1007/s40031-024-01049-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning (RL) has demonstrated impressive results in various fields; however, its performance can be significantly hindered when the training and testing environments differ. Domain adaptation (DA) techniques aim to bridge this gap by moving knowledge between domains. This paper presents a thorough and systematic study of DA in RL. We review and categorize existing approaches for DA in RL, including model-free and model-based. We examine the drawbacks associated with each approach, such as sample inefficiency and generalization issues. Furthermore, we explore various strategies used in DA, such as feature adaptation, reward shaping, and data augmentation. We provide insights into the benefits and drawbacks of different techniques and propose future research directions for enhancing DA in RL. Through this study, the goal is to offer comprehensive insight into the current state of DA in RL and contribute to developing more robust and adaptable RL algorithms. © The Institution of Engineers (India) 2024.
引用
收藏
页码:1223 / 1240
页数:17
相关论文
共 50 条
  • [31] Reinforcement learning for dynamic multimedia adaptation
    Charvillat, Vincent
    Grigoras, Romulus
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2007, 30 (03) : 1034 - 1058
  • [32] A Survey of Domain-Specific Architectures for Reinforcement Learning
    Rothmann, Marc
    Porrmann, Mario
    IEEE ACCESS, 2022, 10 : 13753 - 13767
  • [33] The Role of Machine Learning in Game Development Domain - A Review of Current Trends and Future Directions
    Edwards, Gemma
    Subianto, Nicholas
    Englund, David
    Goh, Jun Wei
    Coughran, Nathan
    Milton, Zachary
    Mirnateghi, Nima
    Shah, Syed Afaq Ali
    2021 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA 2021), 2021, : 495 - 501
  • [34] Fine-tuning Deep Reinforcement Learning Policies with r-STDP for Domain Adaptation
    Akl, Mahmoud
    Sandamirskaya, Yulia
    Ergene, Deniz
    Walter, Florian
    Knoll, Alois
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NEUROMORPHIC SYSTEMS 2022, ICONS 2022, 2022,
  • [35] A Survey on Cross-domain Recommendation: Taxonomies, Methods, and Future Directions
    Zang, Tianzi
    Zhu, Yanmin
    Liu, Haobing
    Zhang, Ruohan
    Yu, Jiadi
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2023, 41 (02)
  • [36] Dynamics-Aware Adaptation for Reinforcement Learning Based Cross-Domain Interactive Recommendation
    Wu, Junda
    Xie, Zhihui
    Yu, Tong
    Zhao, Handong
    Zhang, Ruiyi
    Li, Shuai
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 290 - 300
  • [37] Artificial Intelligence in pathology: current applications, limitations, and future directions
    Sajithkumar, Akhil
    Thomas, Jubin
    Saji, Ajish Meprathumalil
    Ali, Fousiya
    Hasin, E. K. Haneena
    Adampulan, Hannan Abdul Gafoor
    Sarathchand, Swathy
    IRISH JOURNAL OF MEDICAL SCIENCE, 2024, 193 (02) : 1117 - 1121
  • [38] Artificial Intelligence in pathology: current applications, limitations, and future directions
    Akhil Sajithkumar
    Jubin Thomas
    Ajish Meprathumalil Saji
    Fousiya Ali
    Haneena Hasin E.K
    Hannan Abdul Gafoor Adampulan
    Swathy Sarathchand
    Irish Journal of Medical Science (1971 -), 2024, 193 : 1117 - 1121
  • [39] Machine learning approaches to improve disease management of patients with rheumatoid arthritis: review and future directions
    Kedra, Joanna
    Davergne, Thomas
    Braithwaite, Ben
    Servy, Herve
    Gossec, Laure
    EXPERT REVIEW OF CLINICAL IMMUNOLOGY, 2021, 17 (12) : 1311 - 1321
  • [40] The Frontiers of Deep Reinforcement Learning for Resource Management in Future Wireless HetNets: Techniques, Challenges, and Research Directions
    Alwarafy, Abdulmalik
    Abdallah, Mohamed
    Ciftler, Bekir Sait
    Al-Fuqaha, Ala
    Hamdi, Mounir
    IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2022, 3 : 322 - 365