One-Class Classification by Combining Density and Class Probability Estimation

被引:0
作者
Hempstalk, Kathryn [1 ]
Frank, Eibe [1 ]
Witten, Ian H. [1 ]
机构
[1] Univ Waikato, Dept Comp Sci, Hamilton, New Zealand
来源
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PART I, PROCEEDINGS | 2008年 / 5211卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One-class classification has important applications such as outlier and novelty detection. It is commonly tackled using density estimation techniques or by adapting a standard classification algorithm to the problem of carving out a decision boundary that describes the location of the target data. In this paper we investigate a simple method for one-class classification that combines the application of a density estimator, used to form a reference distribution with the induction of a standard model for class probability estimation. In this method, the reference, distribution is used to generate artificial data that is employed to form a second, artificial class. In conjunction with the target class, this artificial class is the basis for a standard two-class learning problem. We explain how the density function of the reference distribution can be combined with the class probability estimates obtained in this way to form a adjusted estimate of the density function of the target class. Using UCI datasets, and data from a typist recognition problem we show that the combined model, consisting of both a density estimator and a class probability estimator, call improve on using either component technique alone when used for one-class classification. We also compare the method to one-class classification using support vector machines.
引用
收藏
页码:505 / 519
页数:15
相关论文
共 16 条
  • [1] ABE N, 2006, P 12 ACM SIGKDD INT, P767
  • [2] Barnett V., 1994, Outliers in Statistical Data, V3rd
  • [3] LIBSVM: A Library for Support Vector Machines
    Chang, Chih-Chung
    Lin, Chih-Jen
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
  • [4] Dowland PS, 2002, INT FED INFO PROC, V86, P215
  • [5] Gunetti D., 2005, ACM Transactions on Information and Systems Security, V8, P312, DOI 10.1145/1085126.1085129
  • [6] Hastie T., 2009, The Elements of Statistical Learning, P9
  • [7] Keystroke dynamics as a biometric for authentication
    Monrose, F
    Rubin, AD
    [J]. FUTURE GENERATION COMPUTER SYSTEMS, 2000, 16 (04) : 351 - 359
  • [8] Nisenson M, 2003, LECT NOTES ARTIF INT, V2838, P363
  • [9] PEARSON R, 2005, MINING INTERFACE DAT
  • [10] Tree induction for probability-based ranking
    Provost, F
    Domingos, P
    [J]. MACHINE LEARNING, 2003, 52 (03) : 199 - 215