A notion of task relatedness yielding provable multiple-task learning guarantees

被引：56

作者：

Ben-David, Shai ^{[1
]}

Borbely, Reba Schuller ^{[1
]}

机构：

[1] Univ Waterloo, Cheriton Sch Comp Sci, Waterloo, ON N21 1G3, Canada

来源：

MACHINE LEARNING | 2008年 / 73卷 / 03期

关键词：

Learning theory; Multi-task learning; Classification prediction; Inductive transfer; VC-dimension; Generalization bounds; Task relatedness;

D O I：

10.1007/s10994-007-5043-5

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The approach of learning multiple "related" tasks simultaneously has proven quite successful in practice; however, theoretical justification for this success has remained elusive. The starting point for previous work on multiple task learning has been that the tasks to be learned jointly are somehow "algorithmically related", in the sense that the results of applying a specific learning algorithm to these tasks are assumed to be similar. We offer an alternative approach, defining relatedness of tasks on the basis of similarity between the example generating distributions that underlie these tasks. We provide a formal framework for this notion of task relatedness, which captures a sub-domain of the wide scope of issues in which one may apply a multiple task learning approach. Our notion of task similarity is relevant to a variety of real life multitask learning scenarios and allows the formal derivation of generalization bounds that are strictly stronger than the previously known bounds for both the learning-to-learn and the multitask learning scenarios. We give precise conditions under which our bounds guarantee generalization on the basis of smaller sample sizes than the standard single-task approach.

引用

页码：273 / 287

页数：15

共 11 条

[1] A model of inductive bias learning [J].

Baxter, J .

JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2000, 12 :149-198

[2]

Baxter J, 1995, COLT

[3]

BENDAVID S, 2003, P 16 ANN C LEARN THE

[4]

BENDAVID S, 2002, P 8 ACM SIGKDD INT C

[5] LEARNABILITY AND THE VAPNIK-CHERVONENKIS DIMENSION [J].

BLUMER, A ;

EHRENFEUCHT, A ;

HAUSSLER, D ;

WARMUTH, MK .

JOURNAL OF THE ACM, 1989, 36 (04) :929-965

[6] Multitask learning [J].

Caruana, R .

MACHINE LEARNING, 1997, 28 (01) :41-75

[7]

HESKES T, 1998, INT C MACH LEARN, P233

[8]

INTRATOR N, 1996, CONNECTION SCI, V8

[9]

Mitchell T. M., 1998, COLT

[10]

Thrun S, 1996, ADV NEUR IN, V8, P640

← 1 2 →