Robust Learning under Uncertain Test Distributions: Relating Covariate Shift to Model Misspecification

被引:0
作者
Wen, Junfeng [1 ]
Yu, Chun-Nam [2 ]
Greiner, Russell [1 ]
机构
[1] Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2E8, Canada
[2] Bell Labs, Alcatel Lucent, Murray Hill, NJ 07974 USA
来源
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 2) | 2014年 / 32卷
关键词
MAXIMUM-ENTROPY; INFERENCE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many learning situations involve learning the conditional distribution p(y vertical bar x) when the training instances are drawn from the training distribution p(tr) (x), even though it will later be used to predict for instances drawn from a different test distribution p(te) (x). Most current approaches focus on learning how to reweigh the training examples, to make them resemble the test distribution. However, reweighing does not always help, because (we show that) the test error also depends on the correctness of the underlying model class. This paper analyses this situation by viewing the problem of learning under changing distributions as a game between a learner and an adversary. We characterize when such reweighing is needed, and also provide an algorithm, robust covariate shift adjustment (RCSA), that provides relevant weights. Our empirical studies, on UCI datasets and a real-world cancer prognostic prediction dataset, show that our analysis applies, and that our RCSA works effectively.
引用
收藏
页码:631 / 639
页数:9
相关论文
共 26 条
[1]   Unifying divergence minimization and statistical inference via convex duality [J].
Altun, Yasemin ;
Smola, Alex .
LEARNING THEORY, PROCEEDINGS, 2006, 4005 :139-153
[2]  
[Anonymous], 2006, P 23 INT C MACHINE, DOI DOI 10.1145/1143844.1143889
[3]  
[Anonymous], 2008, Advances in Neural Information Processing Systems
[4]  
[Anonymous], 2007, Advances in Neural Information Processing Systems
[5]  
[Anonymous], 2007, Proceedings of the 20th International Conference on Neural Information Processing Systems
[6]  
Ben-David Shai, 2007, NIPS
[7]  
Berger AL, 1996, COMPUT LINGUIST, V22, P39
[8]  
Bickel S., 2007, P 24 INT C MACHINE L, P81, DOI DOI 10.1145/1273496.1273507
[9]  
Bickel S, 2009, J MACH LEARN RES, V10, P2137
[10]  
Cortes C, 2008, LECT NOTES ARTIF INT, V5254, P38, DOI 10.1007/978-3-540-87987-9_8