Fast AUC Maximization Learning Machine With Simultaneous Outlier Detection

被引:2
作者
Sun, Yichen [1 ,2 ]
Vong, Chi Man [3 ]
Wang, Shitong [1 ,2 ]
机构
[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Jiangsu, Peoples R China
[2] Taihu Jiangsu Key Construct Lab IoT Applicat Tech, Wuxi 214122, Jiangsu, Peoples R China
[3] Univ Macau, Fac Sci & Technol, Dept Comp & Informat Sci, Macau, Peoples R China
基金
中国国家自然科学基金;
关键词
Support vector machines; Task analysis; Kernel; Time complexity; Anomaly detection; Upper bound; Training; AUC maximization; imbalance classification; minimum enclosing ball (MEB); outlier detection; ROC CURVE; AREA;
D O I
10.1109/TCYB.2022.3164900
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
While AUC maximizing support vector machine (AUCSVM) has been developed to solve imbalanced classification tasks, its huge computational burden will make AUCSVM become impracticable and even computationally forbidden for medium or large-scale imbalanced data. In addition, minority class sometimes means extremely important information for users or is corrupted by noises and/or outliers in practical application scenarios such as medical diagnosis, which actually inspires us to generalize the AUC concept to reflect such importance or upper bound of noises or outliers. In order to address these issues, by means of both the generalized AUC metric and the core vector machine (CVM) technique, a fast AUC maximizing learning machine, called rho-AUCCVM, with simultaneous outlier detection is proposed in this study. rho-AUCCVM has its notorious merits: 1) it indeed shares the CVM's advantage, that is, asymptotically linear time complexity with respect to the total number of sample pairs, together with space complexity independent on the total number of sample pairs and 2) it can automatically determine the importance of the minority class (assuming no noise) or the upper bound of noises or outliers. Extensive experimental results about benchmarking imbalanced datasets verify the above advantages of rho-AUCCVM.
引用
收藏
页码:6843 / 6857
页数:15
相关论文
共 48 条
  • [1] Optimal core-sets for balls
    Badoiu, Mihai
    Clarkson, Kenneth L.
    [J]. COMPUTATIONAL GEOMETRY-THEORY AND APPLICATIONS, 2008, 40 (01): : 14 - 22
  • [2] An elementary proof of the Fritz-John and Karush-Kuhn-Tucker conditions in nonlinear programming
    Birbil, S. I.
    Frenk, J. B. G.
    Still, G. J.
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2007, 180 (01) : 479 - 484
  • [3] Boddupalli S., 2020, PALGRAVE ENCY SOCIAL, P1, DOI DOI 10.1109/CITISIA50690.2020.9371832
  • [4] The use of the area under the roc curve in the evaluation of machine learning algorithms
    Bradley, AP
    [J]. PATTERN RECOGNITION, 1997, 30 (07) : 1145 - 1159
  • [5] Brefeld U., 2005, P ICML WORKSH ROC AN, P1
  • [6] Practical selection of SVM parameters and noise estimation for SVM regression
    Cherkassky, V
    Ma, YQ
    [J]. NEURAL NETWORKS, 2004, 17 (01) : 113 - 126
  • [7] Large-Scale Nonlinear AUC Maximization via Triply Stochastic Gradients
    Dang, Zhiyuan
    Li, Xiang
    Gu, Bin
    Deng, Cheng
    Huang, Heng
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (03) : 1385 - 1398
  • [8] Demir H., 2016, MED SCI DISCOV, V3, P250
  • [9] Scalable TSK Fuzzy Modeling for Very Large Datasets Using Minimal-Enclosing-Ball Approximation
    Deng, Zhaohong
    Choi, Kup-Sze
    Chung, Fu-Lai
    Wang, Shitong
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2011, 19 (02) : 210 - 226
  • [10] Ding Y, 2015, AAAI CONF ARTIF INTE, P2568