Specialization in Hierarchical Learning SystemsA Unified Information-theoretic Approach for Supervised, Unsupervised and Reinforcement Learning

被引：0

作者：

Heinke Hihn

Daniel A. Braun

机构：

[1] Ulm University,Institute for Neural Information Processing

来源：

Neural Processing Letters | 2020年 / 52卷

关键词：

Meta-learning; Information theory; Bounded rationality;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Joining multiple decision-makers together is a powerful way to obtain more sophisticated decision-making systems, but requires to address the questions of division of labor and specialization. We investigate in how far information constraints in hierarchies of experts not only provide a principled method for regularization but also to enforce specialization. In particular, we devise an information-theoretically motivated on-line learning rule that allows partitioning of the problem space into multiple sub-problems that can be solved by the individual experts. We demonstrate two different ways to apply our method: (i) partitioning problems based on individual data samples and (ii) based on sets of data samples representing tasks. Approach (i) equips the system with the ability to solve complex decision-making problems by finding an optimal combination of local expert decision-makers. Approach (ii) leads to decision-makers specialized in solving families of tasks, which equips the system with the ability to solve meta-learning problems. We show the broad applicability of our approach on a range of problems including classification, regression, density estimation, and reinforcement learning problems, both in the standard machine learning setup and in a meta-learning setting.

引用

页码：2319 / 2352

页数：33

共 100 条

[1]

Arulkumaran K(2017)Deep reinforcement learning: a brief survey IEEE Signal Process Mag 34 26-38

[2]

Deisenroth MP(2019)Robust support vector regression in primal with asymmetric huber loss Neural Process Lett 49 1399-1431

[3]

Brundage M(1989)Unsupervised learning Neural Comput 1 295-311

[4]

Bharath AA(2000)Assessing a mixture model for clustering with the integrated completed likelihood IEEE Trans Pattern Anal Mach Intell 22 719-725

[5]

Balasundaram S(2010)Structure learning in action Behav Brain Res 206 157-165

[6]

Meena Y(1997)Multitask learning Mach Learn 28 41-75

[7]

Barlow HB(2014)One and done? Optimal decisions from very few samples Cognit Sci 38 599-637

[8]

Biernacki C(2015)Structure learning in bayesian sensorimotor integration PLoS Comput Biol 11 e1004369-278

[9]

Celeux G(2015)Bounded rationality, abstraction, and hierarchical decision-making: an information-theoretic optimality principle Front Robot AI 2 27-143

[10]

Govaert G(2015)Computational rationality: a converging paradigm for intelligence in brains, minds, and machines Science 349 273-476

← 1 2 3 4 5 6 7 8 9 10 →