F-measure maximizing logistic regression

被引:0
|
作者
Okabe, Masaaki [1 ]
Tsuchida, Jun [1 ]
Yadohisa, Hiroshi [1 ]
机构
[1] Doshisha Univ, Grad Sch Culture & Informat Sci, Kyoto, Japan
关键词
Density ratio; Discriminant analysis; Imbalanced data; Weighted importance; ROC;
D O I
10.1080/03610918.2022.2081706
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Logistic regression is a widely used method in several fields. When applying logistic regression to imbalanced data, wherein the majority classes dominate the minority classes, all class labels are estimated as "majority class." In this study, we use an F-measure optimization method to improve the performance of logistic regression applied to imbalanced data. Although many F-measure optimization methods adopt a ratio of the estimators to approximate the F-measure, the ratio of the estimators tends to exhibit more bias than when the ratio is directly approximated. Therefore, we employ an approximate F-measure to estimate the relative density ratio. In addition, we define and approximate a relative F-measure. We present an algorithm for a logistic regression weighted approximation relative to the F-measure. The results of an experiment using real world data demonstrate that our proposed algorithm can efficiently improve the performance of logistic regression applied to imbalanced data.
引用
收藏
页码:2554 / 2564
页数:11
相关论文
共 50 条
  • [21] Optimizing the Multiclass F-measure via Biconcave Programming
    Pan, Weiwei
    Narasimhan, Harikrishna
    Kar, Purushottam
    Protopapas, Pavlos
    Ramaswamy, Harish G.
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2016, : 1101 - 1106
  • [22] Adjusted F-measure and kernel scaling for imbalanced data learning
    Maratea, Antonio
    Petrosino, Alfredo
    Manzo, Mario
    INFORMATION SCIENCES, 2014, 257 : 331 - 341
  • [23] F-Measure Curves for Visualizing Classifier Performance with Imbalanced Data
    Soleymani, Roghayeh
    Granger, Eric
    Fumera, Giorgio
    ARTIFICIAL NEURAL NETWORKS IN PATTERN RECOGNITION, ANNPR 2018, 2018, 11081 : 165 - 177
  • [24] F-measure curves: A tool to visualize classifier performance under imbalance
    Soleymani, Roghayeh
    Granger, Eric
    Fumera, Giorgio
    PATTERN RECOGNITION, 2020, 100
  • [25] Extreme F-Measure Maximization using Sparse Probability Estimates
    Jasinska, Kalina
    Dembczynski, Krzysztof
    Busa-Fekete, Robert
    Pfannschmidt, Karlson
    Klerx, Timo
    Huellermeier, Eyke
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [26] A note on using the F-measure for evaluating record linkage algorithms
    Hand, David
    Christen, Peter
    STATISTICS AND COMPUTING, 2018, 28 (03) : 539 - 547
  • [27] Cost-Sensitive Hypergraph Learning With F-Measure Optimization
    Wang, Nan
    Liang, Ruozhou
    Zhao, Xibin
    Gao, Yue
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (05) : 2767 - 2778
  • [28] Efficient Optimization of F-Measure with Cost-Sensitive SVM
    Cheng, Fan
    Zhou, Yuan
    Gao, Jian
    Zheng, Shuangqiu
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2016, 2016
  • [29] A Review of the F-Measure: Its History, Properties, Criticism, and Alternatives
    Christen, Peter
    Hand, David J.
    Kirielle, Nishadi
    ACM COMPUTING SURVEYS, 2024, 56 (03)
  • [30] PROFIT MAXIMIZING LOGISTIC REGRESSION MODELING FOR CREDIT SCORING
    Devos, Arnout
    Dhondt, Jakob
    Stripling, Eugen
    Baesens, Bart
    vanden Broucke, Seppe Klm
    Sukhatme, Gaurav
    2018 IEEE DATA SCIENCE WORKSHOP (DSW), 2018, : 125 - 129