Hierarchical Gaussian Mixture based Task Generative Model for Robust Meta-Learning

被引：0

作者：

Zhang, Yizhou ^{[1
]}

Ni, Jingchao ^{[2
]}

Cheng, Wei ^{[3
]}

Chen, Zhengzhang ^{[3
]}

Tong, Liang ^{[4
]}

Chen, Haifeng ^{[3
]}

Liu, Yan ^{[1
]}

机构：

[1] Univ Southern Calif, Los Angeles, CA 90007 USA

[2] AWS AI Labs, Seattle, WA USA

[3] NEC Labs Amer, Irving, TX USA

[4] Stellar Cyber Inc, Seoul, South Korea

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Meta-learning enables quick adaptation of machine learning models to new tasks with limited data. While tasks could come from varying distributions in reality, most of the existing meta-learning methods consider both training and testing tasks as from the same uni-component distribution, overlooking two critical needs of a practical solution: (1) the various sources of tasks may compose a multi-component mixture distribution, and (2) novel tasks may come from a distribution that is unseen during meta-training. In this paper, we demonstrate these two challenges can be solved jointly by modeling the density of task instances. We develop a metatraining framework underlain by a novel Hierarchical Gaussian Mixture based Task Generative Model (HTGM). HTGM extends the widely used empirical process of sampling tasks to a theoretical model, which learns task embeddings, fits the mixture distribution of tasks, and enables density-based scoring of novel tasks. The framework is agnostic to the encoder and scales well with large backbone networks. The model parameters are learned end-to-end by maximum likelihood estimation via an Expectation-Maximization (EM) algorithm. Extensive experiments on benchmark datasets indicate the effectiveness of our method for both sample classification and novel task detection.

引用

页数：24

共 58 条

[1] TASK2VEC: Task Embedding for Meta-Learning
Achille, Alessandro
Lam, Michael
Tewari, Rahul
Ravichandran, Avinash
Maji, Subhransu
Fowlkes, Charless
Soatto, Stefano
Perona, Pietro
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6439 - 6448
[2] Amos B, 2017, PR MACH LEARN RES, V70
[3] [Anonymous], 2018, NeurIPS
[4] Athey Thomas L, 2019, ARXIV190902688
[5] Bishop CM., 2016, PATTERN RECOGN
[6] Personalized Medicine: Progress and Promise
Chan, Isaac S.
Ginsburg, Geoffrey S.
[J]. ANNUAL REVIEW OF GENOMICS AND HUMAN GENETICS, VOL 12, 2011, 12 : 217 - 244
[7] Learning Deep Classifiers Consistent with Fine-Grained Novelty Detection
Cheng, Jiacheng
Vasconcelos, Nuno
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 1664 - 1673
[8] Causal Understanding of Fake News Dissemination on Social Media
Cheng, Lu
Guo, Ruocheng
Shu, Kai
Liu, Huan
[J]. KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 148 - 157
[9] Finn C, 2017, PR MACH LEARN RES, V70
[10] Goldberger J., 2005, P ADV NEUR INF PROC, P505

← 1 2 3 4 5 6 →