One-Shot Generalization in Deep Generative Models

被引：0

作者：

Rezende, Danilo J. ^{[1
]}

Mohamed, Shakir ^{[1
]}

Danihelka, Ivo ^{[1
]}

Gregor, Karol ^{[1
]}

Wierstra, Daan ^{[1
]}

机构：

[1] Google DeepMind, London, England

来源：

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48 | 2016年 / 48卷

关键词：

BAYESIAN-INFERENCE;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Humans have an impressive ability to reason about new concepts and experiences from just a single example. In particular, humans have an ability for one-shot generalization: an ability to encounter a new concept, understand its structure, and then be able to generate compelling alternative variations of the concept. We develop machine learning systems with this important capacity by developing new deep generative models, models that combine the representational power of deep learning with the inferential power of Bayesian reasoning. We develop a class of sequential generative models that are built on the principles of feedback and attention. These two characteristics lead to generative models that are among the state-of-the art in density estimation and image generation. We demonstrate the one-shot generalization ability of our models using three tasks: unconditional sampling, generating new exemplars of a given concept, and generating new exemplars of a family of concepts. In all cases our models are able to generate compelling and diverse samples-having seen new examples just once-providing an important class of general-purpose models for one-shot machine learning.

引用

页数：9

共 30 条

[1]

[Anonymous], 2015, ARXIV151108228

[2]

[Anonymous], 2011, NIPSW

[3]

[Anonymous], 1997, Neural Computation

[4]

[Anonymous], 2010, Advances in Neural Information Processing Systems

[5]

[Anonymous], 2014, Neural Information Processing Systems

[6]

Burda Y., ICLR, P20

[7] What and where A Bayesian inference theory of attention [J].

Chikkerur, Sharat ;

Serre, Thomas ;

Tan, Cheston ;

Poggio, Tomaso .

VISION RESEARCH, 2010, 50 (22) :2233-2247

[8]

Erdogan G., 2015, NIPS

[9]

Eslami S. M., 2016, Advances in Neural Information Processing Systems, V29

[10]

Gregor K, 2014, PR MACH LEARN RES, V32, P1242

← 1 2 3 →