Stochastic Deep Networks with Linear Competing Units for Model-Agnostic Meta-Learning

被引:0
作者
Kalais, Konstantinos [1 ]
Chatzis, Sotirios [1 ]
机构
[1] Cyprus Univ Technol, Dept Elect Eng Comp Eng & Informat, Limassol, Cyprus
来源
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162 | 2022年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work addresses meta-learning (ML) by considering deep networks with stochastic local winner-takes-all (LWTA) activations. This type of network units results in sparse representations from each model layer, as the units are organized into blocks where only one unit generates a non-zero output. The main operating principle of the introduced units rely on stochastic principles, as the network performs posterior sampling over competing units to select the winner. Therefore, the proposed networks are explicitly designed to extract input data representations of sparse stochastic nature, as opposed to the currently standard deterministic representation paradigm. Our approach produces stateof-the-art predictive accuracy on few-shot image classification and regression experiments, as well as reduced predictive error on an active learning setting; these improvements come with an immensely reduced computational cost. Code is available at: https://github.com/ Kkalais/StochLWTA-ML
引用
收藏
页码:10586 / 10597
页数:12
相关论文
共 50 条
[31]   On the Convergence Theory of Gradient-Based Model-Agnostic Meta-Learning Algorithms [J].
Fallah, Alireza ;
Mokhtari, Aryan ;
Ozdaglar, Asuman .
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 :1082-1091
[32]   Visual analysis of meteorological satellite data via model-agnostic meta-learning [J].
Cheng, Shiyu ;
Shen, Hanwei ;
Shan, Guihua ;
Niu, Beifang ;
Bai, Weihua .
JOURNAL OF VISUALIZATION, 2021, 24 (02) :301-315
[33]   Few-shot RUL estimation based on model-agnostic meta-learning [J].
Mo, Yu ;
Li, Liang ;
Huang, Biqing ;
Li, Xiu .
JOURNAL OF INTELLIGENT MANUFACTURING, 2023, 34 (05) :2359-2372
[34]   Few-shot RUL estimation based on model-agnostic meta-learning [J].
Yu Mo ;
Liang Li ;
Biqing Huang ;
Xiu Li .
Journal of Intelligent Manufacturing, 2023, 34 :2359-2372
[35]   Domain-Invariant Speaker Vector Projection by Model-Agnostic Meta-Learning [J].
Kang, Jiawen ;
Liu, Ruiqi ;
Li, Lantian ;
Cai, Yunqi ;
Wang, Dong ;
Zheng, Thomas Fang .
INTERSPEECH 2020, 2020, :3825-3829
[36]   Meta-LSTM in hydrology: Advancing runoff predictions through model-agnostic meta-learning [J].
Cai, Kaixuan ;
He, Jinxin ;
Li, Qingliang ;
Wei, Shangguan ;
Li, Lu ;
Hu, Huiming .
JOURNAL OF HYDROLOGY, 2024, 639
[37]   Memory-Based Optimization Methods for Model-Agnostic Meta-Learning and Personalized Federated Learning [J].
Wang, Bokun ;
Yuan, Zhuoning ;
Ying, Yiming ;
Yang, Tianbao .
JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
[38]   Specific Emitter Identification via Sparse Bayesian Learning Versus Model-Agnostic Meta-Learning [J].
He, Boxiang ;
Wang, Fanggang .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 :3677-3691
[39]   ROBUST MAML: PRIORITIZATION TASK BUFFER WITH ADAPTIVE LEARNING PROCESS FOR MODEL-AGNOSTIC META-LEARNING [J].
Thanh Nguyen ;
Tung Luu ;
Trung Pham ;
Rakhimkul, Sanzhar ;
Yoo, Chang D. .
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, :3460-3464
[40]   Memory-Based Optimization Methods for Model-Agnostic Meta-Learning and Personalized Federated Learning [J].
Wang, Bokun ;
Yuan, Zhuoning ;
Ying, Yiming ;
Yang, Tianbao .
Journal of Machine Learning Research, 2023, 24