The Role of Deconfounding in Meta-learning

被引:0
作者
Jiang, Yinjie [1 ]
Chen, Zhengyu [1 ]
Kuang, Kun [1 ]
Yuan, Luotian [1 ]
Ye, Xinhai [1 ]
Wang, Zhihua [2 ]
Wu, Fei [1 ,3 ]
Wei, Ying [4 ]
机构
[1] Zhejiang Univ, Dept Comp Sci & Technol, Hangzhou, Peoples R China
[2] Zhejiang Univ, Shanghai Inst Adv Study, Shanghai, Peoples R China
[3] Zhejiang Univ, Shanghai AI Lab, Shanghai, Peoples R China
[4] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
来源
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162 | 2022年
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Meta-learning has emerged as a potent paradigm for quick learning of few-shot tasks, by leveraging the meta-knowledge learned from meta-training tasks. Well-generalized meta-knowledge that facilitates fast adaptation in each task is preferred; however, recent evidence suggests the undesirable memorization effect where the meta-knowledge simply memorizing all meta-training tasks discourages task-specific adaptation and poorly generalizes. There have been several solutions to mitigating the effect, including both regularizerbased and augmentation-based methods, while a systematic understanding of these methods in a single framework is still lacking. In this paper, we offer a novel causal perspective of metalearning. Through the lens of causality, we conclude the universal label space as a confounder to be the causing factor of memorization and frame the two lines of prevailing methods as different deconfounder approaches. Remarkably, derived from the causal inference principle of front-door adjustment, we propose two frustratingly easy but effective deconfounder algorithms, i.e., sampling multiple versions of the meta-knowledge via Dropout and grouping the meta-knowledge into multiple bins. The proposed causal perspective not only brings in the two deconfounder algorithms that surpass previous works in four benchmark datasets towards combating memorization, but also opens a promising direction for metalearning.
引用
收藏
页码:10161 / 10176
页数:16
相关论文
共 37 条
  • [1] [Anonymous], 2019, Advances in neural information processing systems, DOI 10.5555/3454287.3455336
  • [2] Antoniou A., 2019, INT C LEARNING REPRE
  • [3] Bengio Y., 1991, IJCNN-91-Seattle: International Joint Conference on Neural Networks (Cat. No.91CH3049-4), DOI 10.1109/IJCNN.1991.155621
  • [4] Bengio Y., 2020, PROC INT C LEARN REP
  • [5] Finn C, 2017, PR MACH LEARN RES, V70
  • [6] Gal Y, 2016, PR MACH LEARN RES, V48
  • [7] Gordon J., 2019, INT C LEARNING REPRE
  • [8] Grant E, 2018, INT C LEARNING REPRE, P1
  • [9] Task Agnostic Meta-Learning for Few-Shot Learning
    Jamal, Muhammad Abdullah
    Qi, Guo-Jun
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11711 - 11719
  • [10] Kocaoglu M, 2018, INT C LEARNING REPRE