Hierarchical Gaussian Mixture based Task Generative Model for Robust Meta-Learning

被引:0
作者
Zhang, Yizhou [1 ]
Ni, Jingchao [2 ]
Cheng, Wei [3 ]
Chen, Zhengzhang [3 ]
Tong, Liang [4 ]
Chen, Haifeng [3 ]
Liu, Yan [1 ]
机构
[1] Univ Southern Calif, Los Angeles, CA 90007 USA
[2] AWS AI Labs, Seattle, WA USA
[3] NEC Labs Amer, Irving, TX USA
[4] Stellar Cyber Inc, Seoul, South Korea
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Meta-learning enables quick adaptation of machine learning models to new tasks with limited data. While tasks could come from varying distributions in reality, most of the existing meta-learning methods consider both training and testing tasks as from the same uni-component distribution, overlooking two critical needs of a practical solution: (1) the various sources of tasks may compose a multi-component mixture distribution, and (2) novel tasks may come from a distribution that is unseen during meta-training. In this paper, we demonstrate these two challenges can be solved jointly by modeling the density of task instances. We develop a metatraining framework underlain by a novel Hierarchical Gaussian Mixture based Task Generative Model (HTGM). HTGM extends the widely used empirical process of sampling tasks to a theoretical model, which learns task embeddings, fits the mixture distribution of tasks, and enables density-based scoring of novel tasks. The framework is agnostic to the encoder and scales well with large backbone networks. The model parameters are learned end-to-end by maximum likelihood estimation via an Expectation-Maximization (EM) algorithm. Extensive experiments on benchmark datasets indicate the effectiveness of our method for both sample classification and novel task detection.
引用
收藏
页数:24
相关论文
共 58 条
  • [1] TASK2VEC: Task Embedding for Meta-Learning
    Achille, Alessandro
    Lam, Michael
    Tewari, Rahul
    Ravichandran, Avinash
    Maji, Subhransu
    Fowlkes, Charless
    Soatto, Stefano
    Perona, Pietro
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6439 - 6448
  • [2] Amos B, 2017, PR MACH LEARN RES, V70
  • [3] [Anonymous], 2018, NeurIPS
  • [4] Athey Thomas L, 2019, ARXIV190902688
  • [5] Bishop CM., 2016, PATTERN RECOGN
  • [6] Personalized Medicine: Progress and Promise
    Chan, Isaac S.
    Ginsburg, Geoffrey S.
    [J]. ANNUAL REVIEW OF GENOMICS AND HUMAN GENETICS, VOL 12, 2011, 12 : 217 - 244
  • [7] Learning Deep Classifiers Consistent with Fine-Grained Novelty Detection
    Cheng, Jiacheng
    Vasconcelos, Nuno
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 1664 - 1673
  • [8] Causal Understanding of Fake News Dissemination on Social Media
    Cheng, Lu
    Guo, Ruocheng
    Shu, Kai
    Liu, Huan
    [J]. KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 148 - 157
  • [9] Finn C, 2017, PR MACH LEARN RES, V70
  • [10] Goldberger J., 2005, P ADV NEUR INF PROC, P505