Efficient Algorithms for Exact Inference in Sequence Labeling SVMs

被引：16

作者：

Bauer, Alexander ^{[1
]}

Goernitz, Nico ^{[1
]}

Biegler, Franziska ^{[1
]}

Mueller, Klaus-Robert ^{[1
,2
]}

Kloft, Marius ^{[3
,4
]}

机构：

[1] Tech Univ Berlin, Machine Learning Grp, D-10587 Berlin, Germany

[2] Korea Univ, Dept Brain & Cognit Engn, Seoul 136713, South Korea

[3] NYU, Courant Inst Math Sci, New York, NY 10012 USA

[4] Mem Sloan Kettering Canc Ctr, New York, NY 10065 USA

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2014年 / 25卷 / 05期

关键词：

Dynamic programming; gene finding; hidden Markov SVM; inference; label sequence learning; margin rescaling; slack rescaling; structural support vector machines (SVMs); structured output; HIDDEN MARKOV-MODELS;

D O I：

10.1109/TNNLS.2013.2281761

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The task of structured output prediction deals with learning general functional dependencies between arbitrary input and output spaces. In this context, two loss-sensitive formulations for maximum-margin training have been proposed in the literature, which are referred to as margin and slack rescaling, respectively. The latter is believed to be more accurate and easier to handle. Nevertheless, it is not popular due to the lack of known efficient inference algorithms; therefore, margin rescaling-which requires a similar type of inference as normal structured prediction-is the most often used approach. Focusing on the task of label sequence learning, we here define a general framework that can handle a large class of inference problems based on Hamming-like loss functions and the concept of decomposability for the underlying joint feature map. In particular, we present an efficient generic algorithm that can handle both rescaling approaches and is guaranteed to find an optimal solution in polynomial time.

引用

页码：870 / 881

页数：12

共 30 条

[1]

Altun Y, 2003, PROCEEDINGS OF THE 2003 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, P145

[2]

Altun Y., 2002, ADV NEURAL INFORM PR, V15, P977

[3]

Altun Y., 2005, ADV NEURAL INFORM PR, V18

[4]

[Anonymous], 2005, Proceedings of the 22nd international conference on Machine learning-ICML'05, DOI [10.1145/1102351.1102373, DOI 10.1145/1102351.1102373]

[5]

[Anonymous], 2003, P 20 INT C MACH LEAR

[6]

Chatterjee S., 2011, UAI, P96

[7]

Cormen TH., 2009, Introduction to Algorithms, V3

[8]

Cortes C., P ICML 2005, P153

[9]

Crammer Koby., 2001, Journal of Machine Learning Research, V2, P2001

[10]

Esposito R, 2009, J MACH LEARN RES, V10, P1851

← 1 2 3 →