A closed-form reduction of multi-class cost-sensitive learning to weighted multi-class learning

被引:17
作者
Xia, Fen [1 ]
Yang, Yan-wu [1 ]
Zhou, Liang [1 ]
Li, Fuxin [1 ]
Cai, Min [2 ]
Zeng, Daniel D. [1 ,3 ]
机构
[1] Chinese Acad Sci, Key Lab Complex Syst & Intelligence Sci, Inst Automat, Beijing 100864, Peoples R China
[2] Beijing Jaiotong Univ, Dept Comp Sci, Beijing, Peoples R China
[3] Univ Arizona, Dept Management Informat Syst, Tucson, AZ 85721 USA
基金
美国国家科学基金会;
关键词
Cost-sensitive learning; Supervised learning; Statistical learning theory; Classification;
D O I
10.1016/j.patcog.2008.12.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
in cost-sensitive learning, misclassification costs can vary for different classes. This paper investigates an approach reducing a multi-class cost-sensitive learning to a standard classification task based on the data space expansion technique developed by Abe et al., which coincides with Elkan's reduction with respect to binary classification tasks. Using this proposed reduction approach, a cost-sensitive learning problem can be solved by considering a standard 0/1 loss classification problem on a new distribution determined by the cost matrix. We also propose a new weighting mechanism to solve the reduced standard classification problem, based on a theorem stating that the empirical loss on independently identically distributed samples from the new distribution is essentially the same as the loss on the expanded weighted training set. Experimental results on several synthetic and benchmark datasets show that our weighting approach is more effective than existing representative approaches for cost-sensitive learning. (C) 2008 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1572 / 1581
页数:10
相关论文
共 24 条
[1]  
Abe N., 2004, P 10 ACM SIGKDD INT, P3
[2]  
[Anonymous], P ACM SIGKDD INT C K
[3]  
BAY SD, 2000, UCI KDD ARCH DEP INF
[4]   SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation [J].
Blewitt, Marnie E. ;
Gendrel, Anne-Valerie ;
Pang, Zhenyi ;
Sparrow, Duncan B. ;
Whitelaw, Nadia ;
Craig, Jeffrey M. ;
Apedaile, Anwyn ;
Hilton, Douglas J. ;
Dunwoodie, Sally L. ;
Brockdorff, Neil ;
Kay, Graham F. ;
Whitelaw, Emma .
NATURE GENETICS, 2008, 40 (05) :663-669
[5]   A MEASURE OF ASYMPTOTIC EFFICIENCY FOR TESTS OF A HYPOTHESIS BASED ON THE SUM OF OBSERVATIONS [J].
CHERNOFF, H .
ANNALS OF MATHEMATICAL STATISTICS, 1952, 23 (04) :493-507
[6]  
Domingos P., 1999, P ACM SIGKDD INT C K, P155, DOI DOI 10.1145/312129.312220
[7]  
Drummond C., 2003, WORKING NOTES ICML 0
[8]  
Elkan C., 2001, P INT JOINT C ART IN, P973, DOI DOI 10.5555/1642194.1642224
[9]  
Hastie T., 2009, The elements of statistical learning: data mining, inference, and prediction, P9
[10]  
LANGFORD J, 2005, 18 ANN C LEARN THEOR