One-Class Classification by Combining Density and Class Probability Estimation

被引：0

作者：

Hempstalk, Kathryn ^{[1
]}

Frank, Eibe ^{[1
]}

Witten, Ian H. ^{[1
]}

机构：

[1] Univ Waikato, Dept Comp Sci, Hamilton, New Zealand

来源：

MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PART I, PROCEEDINGS | 2008年 / 5211卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

One-class classification has important applications such as outlier and novelty detection. It is commonly tackled using density estimation techniques or by adapting a standard classification algorithm to the problem of carving out a decision boundary that describes the location of the target data. In this paper we investigate a simple method for one-class classification that combines the application of a density estimator, used to form a reference distribution with the induction of a standard model for class probability estimation. In this method, the reference, distribution is used to generate artificial data that is employed to form a second, artificial class. In conjunction with the target class, this artificial class is the basis for a standard two-class learning problem. We explain how the density function of the reference distribution can be combined with the class probability estimates obtained in this way to form a adjusted estimate of the density function of the target class. Using UCI datasets, and data from a typist recognition problem we show that the combined model, consisting of both a density estimator and a class probability estimator, call improve on using either component technique alone when used for one-class classification. We also compare the method to one-class classification using support vector machines.

引用

页码：505 / 519

页数：15

共 16 条

[1] ABE N, 2006, P 12 ACM SIGKDD INT, P767
[2] Barnett V., 1994, Outliers in Statistical Data, V3rd
[3] LIBSVM: A Library for Support Vector Machines
Chang, Chih-Chung
Lin, Chih-Jen
[J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[4] Dowland PS, 2002, INT FED INFO PROC, V86, P215
[5] Gunetti D., 2005, ACM Transactions on Information and Systems Security, V8, P312, DOI 10.1145/1085126.1085129
[6] Hastie T., 2009, The Elements of Statistical Learning, P9
[7] Keystroke dynamics as a biometric for authentication
Monrose, F
Rubin, AD
[J]. FUTURE GENERATION COMPUTER SYSTEMS, 2000, 16 (04) : 351 - 359
[8] Nisenson M, 2003, LECT NOTES ARTIF INT, V2838, P363
[9] PEARSON R, 2005, MINING INTERFACE DAT
[10] Tree induction for probability-based ranking
Provost, F
Domingos, P
[J]. MACHINE LEARNING, 2003, 52 (03) : 199 - 215

← 1 2 →