Multi-Class Supervised Novelty Detection

被引:40
作者
Jumutc, Vilen [1 ]
Suykens, Johan A. K. [1 ]
机构
[1] Katholieke Univ Leuven, Dept Elect Engn ESAT SCD SISTA, B-3001 Heverlee, Leuven, Belgium
关键词
Novelty detection; one-class SVM; classification; pattern recognition; labeling information; SUPPORT VECTOR MACHINES; FRAMEWORK;
D O I
10.1109/TPAMI.2014.2327984
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we study the problem of finding a support of unknown high-dimensional distributions in the presence of labeling information, called Supervised Novelty Detection (SND). The One-Class Support Vector Machine (SVM) is a widely used kernel-based technique to address this problem. However with the latter approach it is difficult to model a mixture of distributions from which the support might be constituted. We address this issue by presenting a new class of SVM-like algorithms which help to approach multi-class classification and novelty detection from a new perspective. We introduce a new coupling term between classes which leverages the problem of finding a good decision boundary while preserving the compactness of a support with the l(2)-norm penalty. First we present our optimization objective in the primal and then derive a dual QP formulation of the problem. Next we propose a Least-Squares formulation which results in a linear system which drastically reduces computational costs. Finally we derive a Pegasos-based formulation which can effectively cope with large data sets that cannot be handled by many existing QP solvers. We complete our paper with experiments that validate the usefulness and practical importance of the proposed methods both in classification and novelty detection settings.
引用
收藏
页码:2510 / 2523
页数:14
相关论文
共 32 条
  • [1] [Anonymous], 2006, Association for the Advancement of Artificial Intelligence
  • [2] Belkin M, 2006, J MACH LEARN RES, V7, P2399
  • [3] Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
  • [4] Campbell C, 2001, ADV NEUR IN, V13, P395
  • [5] LIBSVM: A Library for Support Vector Machines
    Chang, Chih-Chung
    Lin, Chih-Jen
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
  • [6] The paired t test under artificial pairing
    David, HA
    Gunnink, JL
    [J]. AMERICAN STATISTICIAN, 1997, 51 (01) : 9 - 12
  • [7] Optimized fixed-size kernel models for large data sets
    De Brabanter, K.
    De Brabanter, J.
    Suykens, J. A. K.
    De Moor, B.
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2010, 54 (06) : 1484 - 1504
  • [8] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM
    DEMPSTER, AP
    LAIRD, NM
    RUBIN, DB
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01): : 1 - 38
  • [9] Applications of probability density estimation to the detection of abnormal conditions in engineering
    Desforges, MJ
    Jacob, PJ
    Cooper, JE
    [J]. PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART C-JOURNAL OF MECHANICAL ENGINEERING SCIENCE, 1998, 212 (08) : 687 - 703
  • [10] Frank A., 2010, UCI machine learning repository, V213