Depth-based support vector classifiers to detect data nests of rare events

被引:0
|
作者
Dyckerhoff, Rainer [1 ]
Stenz, Hartmut Jakob [1 ]
机构
[1] Univ Cologne, Inst Econometr & Stat, Cologne, Germany
关键词
data depth; DD-plot; Mahalanobis depth function; support vector machines; SVM; binary classification; hybrid methods; rare events; data nest; churn prediction; big data; CLASSIFICATION; MACHINES;
D O I
暂无
中图分类号
F [经济];
学科分类号
02 ;
摘要
The aim of this project is to combine data depth with support vector machines (SVM) for binary classification. To this end, we introduce data depth functions and SVM and discuss why a combination of the two is assumed to work better in some cases than using SVM alone. For two classes X and Y , one investigates whether an individual data point should be assigned to one of these classes. In this context, our focus lies on the detection of rare events, which are structured in data nests: class X contains much more data points than class Y and Y has less dispersion than X. This form of classification problem is akin to finding the proverbial needle in a haystack. Data structures like these are important in churn prediction analyses which will serve as a motivation for possible applications. Beyond the analytical investigations, comprehensive simulation studies will also be carried out.
引用
收藏
页码:107 / 142
页数:36
相关论文
共 50 条