Depth-based support vector classifiers to detect data nests of rare events

被引:0
|
作者
Dyckerhoff, Rainer [1 ]
Stenz, Hartmut Jakob [1 ]
机构
[1] Univ Cologne, Inst Econometr & Stat, Cologne, Germany
关键词
data depth; DD-plot; Mahalanobis depth function; support vector machines; SVM; binary classification; hybrid methods; rare events; data nest; churn prediction; big data; CLASSIFICATION; MACHINES;
D O I
暂无
中图分类号
F [经济];
学科分类号
02 ;
摘要
The aim of this project is to combine data depth with support vector machines (SVM) for binary classification. To this end, we introduce data depth functions and SVM and discuss why a combination of the two is assumed to work better in some cases than using SVM alone. For two classes X and Y , one investigates whether an individual data point should be assigned to one of these classes. In this context, our focus lies on the detection of rare events, which are structured in data nests: class X contains much more data points than class Y and Y has less dispersion than X. This form of classification problem is akin to finding the proverbial needle in a haystack. Data structures like these are important in churn prediction analyses which will serve as a motivation for possible applications. Beyond the analytical investigations, comprehensive simulation studies will also be carried out.
引用
收藏
页码:107 / 142
页数:36
相关论文
共 50 条
  • [1] Nonparametrically consistent depth-based classifiers
    Paindaveine, Davy
    van Bever, Germain
    BERNOULLI, 2015, 21 (01) : 62 - 82
  • [2] Depth-based classification for functional data
    Lopez-Pintado, Sara
    Romo, Juan
    Data Depth: Robust Multivariate Analysis, Computational Geometry and Applications, 2006, 72 : 103 - 119
  • [3] Data depth based support vector machines for predicting corporate bankruptcy
    Kim, Sungdo
    Mun, Byeong Min
    Bae, Suk Joo
    APPLIED INTELLIGENCE, 2018, 48 (03) : 791 - 804
  • [4] Depth-based inference for functional data
    Lopez-Pintado, Sara
    Romo, Juan
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 51 (10) : 4957 - 4968
  • [5] Depth-based classification for relational data with multiple attributes
    Zhang, Xu
    Tian, Yahui
    Guan, Guoyu
    Gel, Yulia R.
    JOURNAL OF MULTIVARIATE ANALYSIS, 2021, 184
  • [6] Selection of Support Vector Machines based classifiers for credit risk domain
    Danenas, Paulius
    Garsva, Gintautas
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (06) : 3194 - 3204
  • [7] Robust depth-based tools for the analysis of gene expression data
    Lopez-Pintado, Sara
    Romo, Juan
    Torrente, Aurora
    BIOSTATISTICS, 2010, 11 (02) : 254 - 264
  • [8] Data depth-based nonparametric scale tests
    Chenouri, Shojaeddin
    Small, Christopher G.
    Farrar, Thomas J.
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2011, 39 (02): : 356 - 369
  • [9] Credit Risk Evaluation Model Development Using Support Vector Based Classifiers
    Danenas, Paulius
    Garsva, Gintautas
    Gudas, Saulius
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS), 2011, 4 : 1699 - 1707
  • [10] Data Depth-Based Nonparametric Tests for Multivariate Scales
    Somanath D. Pawar
    Digambar T. Shirke
    Journal of Statistical Theory and Practice, 2022, 16