The Role of Deconfounding in Meta-learning

被引：0

作者：

Jiang, Yinjie ^{[1
]}

Chen, Zhengyu ^{[1
]}

Kuang, Kun ^{[1
]}

Yuan, Luotian ^{[1
]}

Ye, Xinhai ^{[1
]}

Wang, Zhihua ^{[2
]}

Wu, Fei ^{[1
,3
]}

Wei, Ying ^{[4
]}

机构：

[1] Zhejiang Univ, Dept Comp Sci & Technol, Hangzhou, Peoples R China

[2] Zhejiang Univ, Shanghai Inst Adv Study, Shanghai, Peoples R China

[3] Zhejiang Univ, Shanghai AI Lab, Shanghai, Peoples R China

[4] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China

来源：

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162 | 2022年

基金：

中国国家自然科学基金;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Meta-learning has emerged as a potent paradigm for quick learning of few-shot tasks, by leveraging the meta-knowledge learned from meta-training tasks. Well-generalized meta-knowledge that facilitates fast adaptation in each task is preferred; however, recent evidence suggests the undesirable memorization effect where the meta-knowledge simply memorizing all meta-training tasks discourages task-specific adaptation and poorly generalizes. There have been several solutions to mitigating the effect, including both regularizerbased and augmentation-based methods, while a systematic understanding of these methods in a single framework is still lacking. In this paper, we offer a novel causal perspective of metalearning. Through the lens of causality, we conclude the universal label space as a confounder to be the causing factor of memorization and frame the two lines of prevailing methods as different deconfounder approaches. Remarkably, derived from the causal inference principle of front-door adjustment, we propose two frustratingly easy but effective deconfounder algorithms, i.e., sampling multiple versions of the meta-knowledge via Dropout and grouping the meta-knowledge into multiple bins. The proposed causal perspective not only brings in the two deconfounder algorithms that surpass previous works in four benchmark datasets towards combating memorization, but also opens a promising direction for metalearning.

引用

页码：10161 / 10176

页数：16

共 37 条

[1] [Anonymous], 2019, Advances in neural information processing systems, DOI 10.5555/3454287.3455336
[2] Antoniou A., 2019, INT C LEARNING REPRE
[3] Bengio Y., 1991, IJCNN-91-Seattle: International Joint Conference on Neural Networks (Cat. No.91CH3049-4), DOI 10.1109/IJCNN.1991.155621
[4] Bengio Y., 2020, PROC INT C LEARN REP
[5] Finn C, 2017, PR MACH LEARN RES, V70
[6] Gal Y, 2016, PR MACH LEARN RES, V48
[7] Gordon J., 2019, INT C LEARNING REPRE
[8] Grant E, 2018, INT C LEARNING REPRE, P1
[9] Task Agnostic Meta-Learning for Few-Shot Learning
Jamal, Muhammad Abdullah
Qi, Guo-Jun
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11711 - 11719
[10] Kocaoglu M, 2018, INT C LEARNING REPRE

← 1 2 3 4 →