Specialization in Hierarchical Learning SystemsA Unified Information-theoretic Approach for Supervised, Unsupervised and Reinforcement Learning

被引:0
作者
Heinke Hihn
Daniel A. Braun
机构
[1] Ulm University,Institute for Neural Information Processing
来源
Neural Processing Letters | 2020年 / 52卷
关键词
Meta-learning; Information theory; Bounded rationality;
D O I
暂无
中图分类号
学科分类号
摘要
Joining multiple decision-makers together is a powerful way to obtain more sophisticated decision-making systems, but requires to address the questions of division of labor and specialization. We investigate in how far information constraints in hierarchies of experts not only provide a principled method for regularization but also to enforce specialization. In particular, we devise an information-theoretically motivated on-line learning rule that allows partitioning of the problem space into multiple sub-problems that can be solved by the individual experts. We demonstrate two different ways to apply our method: (i) partitioning problems based on individual data samples and (ii) based on sets of data samples representing tasks. Approach (i) equips the system with the ability to solve complex decision-making problems by finding an optimal combination of local expert decision-makers. Approach (ii) leads to decision-makers specialized in solving families of tasks, which equips the system with the ability to solve meta-learning problems. We show the broad applicability of our approach on a range of problems including classification, regression, density estimation, and reinforcement learning problems, both in the standard machine learning setup and in a meta-learning setting.
引用
收藏
页码:2319 / 2352
页数:33
相关论文
共 100 条
[1]  
Arulkumaran K(2017)Deep reinforcement learning: a brief survey IEEE Signal Process Mag 34 26-38
[2]  
Deisenroth MP(2019)Robust support vector regression in primal with asymmetric huber loss Neural Process Lett 49 1399-1431
[3]  
Brundage M(1989)Unsupervised learning Neural Comput 1 295-311
[4]  
Bharath AA(2000)Assessing a mixture model for clustering with the integrated completed likelihood IEEE Trans Pattern Anal Mach Intell 22 719-725
[5]  
Balasundaram S(2010)Structure learning in action Behav Brain Res 206 157-165
[6]  
Meena Y(1997)Multitask learning Mach Learn 28 41-75
[7]  
Barlow HB(2014)One and done? Optimal decisions from very few samples Cognit Sci 38 599-637
[8]  
Biernacki C(2015)Structure learning in bayesian sensorimotor integration PLoS Comput Biol 11 e1004369-278
[9]  
Celeux G(2015)Bounded rationality, abstraction, and hierarchical decision-making: an information-theoretic optimality principle Front Robot AI 2 27-143
[10]  
Govaert G(2015)Computational rationality: a converging paradigm for intelligence in brains, minds, and machines Science 349 273-476