Learning latent variable structured prediction models with Gaussian perturbations

被引：0

作者：

Bello, Kevin ^{[1
]}

Honorio, Jean ^{[1
]}

机构：

[1] Purdue Univ, Dept Comp Sci, W Lafayette, IN 47907 USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018) | 2018年 / 31卷

基金：

美国国家科学基金会;

关键词：

NUMBER;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The standard margin-based structured prediction commonly uses a maximum loss over all possible structured outputs [26, 1, 5, 25]. The large-margin formulation including latent variables [30, 21] not only results in a non-convex formulation but also increases the search space by a factor of the size of the latent space. Recent work [11] has proposed the use of the maximum loss over random structured outputs sampled independently from some proposal distribution, with theoretical guarantees. We extend this work by including latent variables. We study a new family of loss functions under Gaussian perturbations and analyze the effect of the latent space on the generalization bounds. We show that the non-convexity of learning with latent variables originates naturally, as it relates to a tight upper bound of the Gibbs decoder distortion with respect to the latent space. Finally, we provide a formulation using random samples and relaxations that produces a tighter upper bound of the Gibbs decoder distortion up to a statistical accuracy, which enables a polynomial time evaluation of the objective function. We illustrate the method with synthetic experiments and a computer vision application.

引用

页数：11

共 33 条

[1] Altun Y., 2003, European Conference on Speech Communication and Technology, P145
[2] [Anonymous], 2006, Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition-Volume 2, Washington, DC, USA
[3] [Anonymous], 2001, PROC 18 INT C MACH L
[4] DETERMINATION OF THE NUMBER OF INDEPENDENT PARAMETERS OF A SCORE MATRIX FROM THE EXAMINATION OF RANK ORDERS
BENNETT, JF
[J]. PSYCHOMETRIKA, 1956, 21 (04) : 383 - 393
[5] MULTIDIMENSIONAL UNFOLDING - DETERMINING THE DIMENSIONALITY OF RANKED PREFERENCE DATA
BENNETT, JF
HAYS, WL
[J]. PSYCHOMETRIKA, 1960, 25 (01) : 27 - 43
[6] Choi H, 2016, JMLR WORKSH CONF PRO, V51, P667
[7] Collins M, 2004, TEXT SPEECH LANG TEC, P19
[8] Collins M., 2004, P ANN M ASS COMPUTAT, P111
[9] Cortes C., 2016, ADV NEURAL INFORM PR, P2514
[10] NUMBER OF LINEARLY INDUCIBLE ORDERINGS OF POINNTS IN D-SPACE
COVER, TM
[J]. SIAM JOURNAL ON APPLIED MATHEMATICS, 1967, 15 (02) : 434 - &

← 1 2 3 4 →