GEV regression with convex loss applied to imbalanced binary classification

被引:3
作者
Zhang, Haolin [1 ]
Liu, Gongshen [1 ]
Pan, Li [1 ]
Meng, Kui [1 ]
Li, Jianhua [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai, Peoples R China
来源
2016 IEEE FIRST INTERNATIONAL CONFERENCE ON DATA SCIENCE IN CYBERSPACE (DSC 2016) | 2016年
关键词
GEV; CPE; GLM; convex loss; imbalanced data;
D O I
10.1109/DSC.2016.88
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we consider the problem of binary classification with imbalanced data. Although this problem has been studied extensively in terms of the classification performance, the probability estimation of both majority and minority class has not yet been well studied. In order to make precise class probability estimation (CPE), we propose a new approach of regression with a recently proposed convex loss function under the framework of generalized linear model. In this model, the generalized extreme value (GEV) distribution is adopted to form the asymmetric link function, which is the key role in binary classification with imbalanced data. Also, we propose a method to estimate the shape parameter in GEV distribution. Experiments on real-world datasets show that our proposed GEV regression has a good classification performance as well as a precise CPE. Besides, comparisons with other optimization algorithms also suggest a high computational efficiency in our algorithm.
引用
收藏
页码:532 / 537
页数:6
相关论文
共 29 条
  • [1] Bootstrapping binary GEV regressions for imbalanced datasets
    La Rocca, Michele
    Niglio, Marcella
    Restaino, Marialuisa
    COMPUTATIONAL STATISTICS, 2024, 39 (01) : 181 - 213
  • [2] A GEV-Based Classification Algorithm for Imbalanced Data
    Fu J.
    Liu G.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2018, 55 (11): : 2361 - 2371
  • [3] High-dimensional regression and classification under a class of convex loss functions
    Jiang, Yuan
    Zhang, Chunming
    STATISTICS AND ITS INTERFACE, 2013, 6 (02) : 285 - U143
  • [4] A Hybrid Approach for Binary Classification of Imbalanced Data
    Tsai, Hsinhan
    Yang, Ta-Wei
    Wong, Wai-Man
    Kao, Han-Yi
    Chou, Cheng-Fu
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2024, 23 (03)
  • [5] Meta-learning for imbalanced data and classification ensemble in binary classification
    Lin, Sung-Chiang
    Chang, Yuan-chin I.
    Yang, Wei-Ning
    NEUROCOMPUTING, 2009, 73 (1-3) : 484 - 494
  • [6] Multiclass SVM with Ramp Loss for Imbalanced Data Classification
    Phoungphol, Piyaphol
    Zhang, Yanqing
    Zhao, Yichuan
    Srichandan, Bismita
    2012 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING (GRC 2012), 2012, : 376 - 381
  • [7] Performance of asymmetric links and correction methods for imbalanced data in binary regression
    Huayanay, Alex de la Cruz
    Bazan, Jorge L.
    Cancho, Vicente G.
    Dey, Dipak K.
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2019, 89 (09) : 1694 - 1714
  • [8] A Correction Method of a Base Classifier Applied to Imbalanced Data Classification
    Trajdos, Pawel
    Kurzynski, Marek
    COMPUTATIONAL SCIENCE - ICCS 2020, PT IV, 2020, 12140 : 88 - 102
  • [9] Downsampling for Binary Classification with a Highly Imbalanced Dataset Using Active Learning
    Lee, Wonjae
    Seo, Kangwon
    BIG DATA RESEARCH, 2022, 28
  • [10] Dense fuzzy support vector machine to binary classification for imbalanced data
    Wang, Qingling
    Zheng, Jian
    Zhang, Wenjing
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (06) : 9643 - 9653