Transfer Learning for Dynamical Systems Models via Autoencoders and GANs

被引:0
作者
Damiani, Angelo [1 ]
Lopez, Gustavo Viera [1 ]
Manganini, Giorgio [2 ]
Metelli, Alberto Maria [3 ]
Restelli, Marcello [3 ]
机构
[1] Gran Sasso Sci Inst, Laquila, Italy
[2] Fraunhofer Res Italia, Bolzano, Italy
[3] Politecn Milan, Milan, Italy
来源
2024 AMERICAN CONTROL CONFERENCE, ACC 2024 | 2024年
关键词
Transfer Learning; Reinforcement Learning; Markov Decision Process; Autoencoders; Model Transfer; Instance Transfer;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Transfer learning has seen significant progress in the domains of supervised learning and reinforcement learning, yet there remains a noticeable gap in the area of regression. In supervised learning, various transfer learning methods have been developed to leverage knowledge from one task to improve performance on another, but these approaches are primarily designed for classification tasks. In the realm of reinforcement learning, many techniques focus on policy transfer, and those that do address samples transfer predominantly apply it within homogeneous contexts. This paper presents a novel algorithm that bridges the transfer learning gap between supervised learning and reinforcement learning while specifically addressing the regression problem of dynamical systems' model estimation. Our approach harnesses the feature extraction capabilities of autoencoders and the generative power of Generative Adversarial Networks (GANs) to train a mapping that facilitates the seamless transformation of samples between dynamical systems. This approach represents a significant advancement in the field of transfer learning, as it offers a versatile and effective solution for transferring regression models across heterogeneous domains. Our experimental results demonstrate the algorithm's efficacy and its potential to improve model generalization and adaptability in diverse scenarios with varying data distributions and dynamics.
引用
收藏
页码:8 / 14
页数:7
相关论文
共 34 条
[1]  
Ammar H. B., 2012, Proceedings of the 11th Interna- tional Conference on Autonomous Agents and Multiagent Systems, P383
[2]  
[Anonymous], 2007, Proceedings of the 24th international conference on Machine learning, DOI 10.1145/1273496.1273607
[3]  
Brockman G, 2016, Arxiv, DOI arXiv:1606.01540
[4]   Probabilistic Policy Reuse for inter-task transfer learning [J].
Fernandez, Fernando ;
Garcia, Javier ;
Veloso, Manuela .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2010, 58 (07) :866-871
[5]  
Garcke Jochen, 2014, Machine Learning and Knowledge Discovery in Databases. European Conference, ECML PKDD 2014. Proceedings: LNCS 8724, P466, DOI 10.1007/978-3-662-44848-9_30
[6]   Generative Adversarial Networks [J].
Goodfellow, Ian ;
Pouget-Abadie, Jean ;
Mirza, Mehdi ;
Xu, Bing ;
Warde-Farley, David ;
Ozair, Sherjil ;
Courville, Aaron ;
Bengio, Yoshua .
COMMUNICATIONS OF THE ACM, 2020, 63 (11) :139-144
[7]  
Gupta A, 2017, Arxiv, DOI arXiv:1703.02949
[8]   Transfer learning techniques for medical image analysis: A review [J].
Kora, Padmavathi ;
Ooi, Chui Ping ;
Faust, Oliver ;
Raghavendra, U. ;
Gudigar, Anjan ;
Chan, Wai Yee ;
Meenakshi, K. ;
Swaraja, K. ;
Plawiak, Pawel ;
Acharya, U. Rajendra .
BIOCYBERNETICS AND BIOMEDICAL ENGINEERING, 2022, 42 (01) :79-107
[9]   NONLINEAR PRINCIPAL COMPONENT ANALYSIS USING AUTOASSOCIATIVE NEURAL NETWORKS [J].
KRAMER, MA .
AICHE JOURNAL, 1991, 37 (02) :233-243
[10]  
Lazaric A., 2011, Advances in neural information processing systems, V24