Robust supervised classification with mixture models: Learning from data with uncertain labels

被引:87
作者
Bouveyron, Charles [1 ]
Girard, Stephane [1 ]
机构
[1] Univ Paris 01, SAMOS MATISSE, CES, UMR CNRS 8174, Pantheon Sorbonne, France
关键词
Supervised classification; Data with uncertain labels; Mixture models; Robustness; Label noise; Weakly supervised classification; DISCRIMINANT-ANALYSIS; SCALE;
D O I
10.1016/j.patcog.2009.03.027
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the supervised classification framework, human supervision is required for labeling a set of learning data which are then used for building the classifier. However, in many applications, human supervision is either imprecise, difficult or expensive. In this paper, the problem of learning a Supervised multi-class classifier from data with uncertain labels is considered and a model-based classification method is proposed to solve it. The idea of the proposed method is to confront an unsupervised modeling of the data with the supervised information carried by the labels of the learning data in order to detect inconsistencies. The method is able afterward to build a robust classifier taking into account the detected inconsistencies into the labels. Experiments on artificial and real data are provided to highlight the main features of the proposed method as well as an application to object recognition Under weak Supervision. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2649 / 2658
页数:10
相关论文
共 33 条
[1]  
[Anonymous], P 18 INT C MACH LEAR
[2]   MODEL-BASED GAUSSIAN AND NON-GAUSSIAN CLUSTERING [J].
BANFIELD, JD ;
RAFTERY, AE .
BIOMETRICS, 1993, 49 (03) :803-821
[3]   High breakdown mixture discriminant analysis [J].
Bashir, S ;
Carter, EM .
JOURNAL OF MULTIVARIATE ANALYSIS, 2005, 93 (01) :102-111
[4]   DYNAMIC PROGRAMMING [J].
BELLMAN, R .
SCIENCE, 1966, 153 (3731) :34-&
[5]   Regularized Gaussian discriminant analysis through eigenvalue decomposition [J].
Bensmail, H ;
Celeux, G .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1996, 91 (436) :1743-1748
[6]   High-dimensional data clustering [J].
Bouveyron, C. ;
Girard, S. ;
Schmid, C. .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 52 (01) :502-519
[7]  
BOUVEYRON C, 2006, 5 IND C COMP VIS GRA, P457
[8]   Identifying mislabeled training data [J].
Brodley, CE ;
Friedl, MA .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1999, 11 :131-167
[9]  
BUC FD, 2006, P 1 PASCAL CHALL WOR
[10]   GAUSSIAN PARSIMONIOUS CLUSTERING MODELS [J].
CELEUX, G ;
GOVAERT, G .
PATTERN RECOGNITION, 1995, 28 (05) :781-793