Continual Learning of Generative Models With Limited Data: From Wasserstein-1 Barycenter to Adaptive Coalescence

被引:0
|
作者
Dedeoglu, Mehmet [1 ]
Lin, Sen [2 ]
Zhang, Zhaofeng [1 ]
Zhang, Junshan [3 ]
机构
[1] Arizona State Univ, Sch Elect Comp & Energy Engn, Tempe, AZ 85281 USA
[2] Ohio State Univ, AI EDGE Inst, Columbus, OH 43210 USA
[3] Univ Calif Davis, Dept Elect & Comp Engn, Davis, CA 95616 USA
关键词
Adaptation models; Data models; Computational modeling; Optimization; Solid modeling; Task analysis; Servers; Continual learning; generative adversarial networks (GANs); optimal transport theory; Wasserstein barycenters; OPTIMAL-TRANSPORT;
D O I
10.1109/TNNLS.2023.3251096
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning generative models is challenging for a network edge node with limited data and computing power. Since tasks in similar environments share a model similarity, it is plausible to leverage pretrained generative models from other edge nodes. Appealing to optimal transport theory tailored toward Wasserstein-1 generative adversarial networks (WGANs), this study aims to develop a framework that systematically optimizes continual learning of generative models using local data at the edge node while exploiting adaptive coalescence of pretrained generative models. Specifically, by treating the knowledge transfer from other nodes as Wasserstein balls centered around their pretrained models, continual learning of generative models is cast as a constrained optimization problem, which is further reduced to a Wasserstein-1 barycenter problem. A two-stage approach is devised accordingly: 1) the barycenters among the pretrained models are computed offline, where displacement interpolation is used as the theoretic foundation for finding adaptive barycenters via a "recursive" WGAN configuration and 2) the barycenter computed offline is used as metamodel initialization for continual learning, and then, fast adaptation is carried out to find the generative model using the local samples at the target edge node. Finally, a weight ternarization method, based on joint optimization of weights and threshold for quantization, is developed to compress the generative model further. Extensive experimental studies corroborate the effectiveness of the proposed framework.
引用
收藏
页码:12042 / 12056
页数:15
相关论文
共 13 条
  • [1] Generative Models from the perspective of Continual Learning
    Lesort, Timothee
    Caselles-Dupre, Hugo
    Garcia-Ortiz, Michael
    Stoian, Andrei
    Filliat, David
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [2] Adaptive Feature Generation for Online Continual Learning from Imbalanced Data
    Jian, Yingchun
    Yi, Jinfeng
    Zhang, Lijun
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2022, PT I, 2022, 13280 : 276 - 289
  • [3] Learning Generative Models for Climbing Aircraft from Radar Data
    Pepper, Nick
    Thomas, Marc
    JOURNAL OF AEROSPACE INFORMATION SYSTEMS, 2024, 21 (06): : 474 - 481
  • [4] Synthetic observations from deep generative models and binary omics data with limited sample size
    Nussberger, Jens
    Boesel, Frederic
    Lenz, Stefan
    Binder, Harald
    Hess, Moritz
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (04)
  • [5] A virtual sample generation method based on manifold learning and a generative adversarial network for soft sensor models with limited data
    Bai, Xinpeng
    Li, Shaojun
    JOURNAL OF THE TAIWAN INSTITUTE OF CHEMICAL ENGINEERS, 2023, 151
  • [6] Training deep-learning segmentation models from severely limited data
    Zhao, Yao
    Rhee, Dong Joo
    Cardenas, Carlos
    Court, Laurence E.
    Yang, Jinzhong
    MEDICAL PHYSICS, 2021, 48 (04) : 1697 - 1706
  • [7] Learning Adaptive Forecasting Models from Irregularly Sampled Multivariate Clinical Data
    Liu, Zitao
    Hauskrecht, Milos
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 1273 - 1279
  • [8] Discovering generative models from event logs: data-driven simulation vs deep learning
    Camargo, Manuel
    Dumas, Marlon
    Gonzalez-Rojas, Oscar
    PEERJ COMPUTER SCIENCE, 2021, 7
  • [9] Learning Predictive Models from Integrated Healthcare Data: Extending Pattern-based and Generative Models to Capture Temporal and Cross-Attribute Dependencies
    Henriques, Rui
    Antunes, Claudia
    2014 47TH HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES (HICSS), 2014, : 2562 - 2569
  • [10] From Synthetic To Real: Enhancing Deep Learning Models With Generative Adversarial Networks For Efficient Data Utilization In Automatic Retail Stores
    Cong-Ty Dang
    Vu-Hoang Tran
    Ngoc-Hoang-Lam Le
    Huang, Ching-Chun
    2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 1726 - 1731