Large-Scale Fuzzy Least Squares Twin SVMs for Class Imbalance Learning

被引:36
作者
Ganaie, M. A. [1 ]
Tanveer, M. [1 ]
Lin, Chin-Teng [2 ]
机构
[1] Indian Inst Technol Indore, Dept Math, Indore 453552, India
[2] Univ Technol Sydney, Fac Engn & Informat Technol, Ctr Artificial Intelligence, Ultimo, NSW 2007, Australia
基金
美国国家卫生研究院;
关键词
Support vector machines; Kernel; Risk management; Minimization; Computational modeling; Alzheimer's disease; Data models; Alzheimer's disease (AD); class imbalance; machine learning; magnetic resonance imaging (MRI); maximum margin; mild cognitive impairment (MCI); pattern classification; structural risk minimization (SRM) principle; support vector machines (SVMs); twin support vector machine (TSVM); SUPPORT VECTOR MACHINES; CLASSIFICATION;
D O I
10.1109/TFUZZ.2022.3161729
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Twin support vector machines (TSVMs) have been successfully employed for binary classification problems. With the advent of machine learning algorithms, data have proliferated and there is a need to handle or process large-scale data. TSVMs are not successful in handling large-scale data due to the following: 1) the optimization problem solved in the TSVM needs to calculate large matrix inverses, which makes it an ineffective choice for large-scale problems; 2) the empirical risk minimization principle is employed in the TSVM and, hence, may suffer due to overfitting; and 3) the Wolfe dual of TSVM formulation involves positive-semidefinite matrices, and hence, singularity issues need to be resolved manually. Keeping in view the aforementioned shortcomings, in this article, we propose a novel large-scale fuzzy least squares TSVM for class imbalance learning (LS-FLSTSVM-CIL). We formulate the LS-FLSTSVM-CIL such that the proposed optimization problem ensures that: 1) no matrix inversion is involved in the proposed LS-FLSTSVM-CIL formulation, which makes it an efficient choice for large-scale problems; 2) the structural risk minimization principle is implemented, which avoids the issues of overfitting and results in better performance; and 3) the Wolfe dual formulation of the proposed LS-FLSTSVM-CIL model involves positive-definite matrices. In addition, to resolve the issues of class imbalance, we assign fuzzy weights in the proposed LS-FLSTSVM-CIL to avoid bias in dominating the samples of class imbalance problems. To make it more feasible for large-scale problems, we use an iterative procedure known as the sequential minimization principle to solve the objective function of the proposed LS-FLSTSVM-CIL model. From the experimental results, one can see that the proposed LS-FLSTSVM-CIL demonstrates superior performance in comparison to baseline classifiers. To demonstrate the feasibility of the proposed LS-FLSTSVM-CIL on large-scale classification problems, we evaluate the classification models on the large-scale normally distributed clustered (NDC) dataset. To demonstrate the practical applications of the proposed LS-FLSTSVM-CIL model, we evaluate it for the diagnosis of Alzheimer's disease and breast cancer disease. Evaluation on NDC datasets shows that the proposed LS-FLSTSVM-CIL has feasibility in large-scale problems as it is fast in comparison to the baseline classifiers.
引用
收藏
页码:4815 / 4827
页数:13
相关论文
共 47 条
[1]  
Alcalá-Fdez J, 2011, J MULT-VALUED LOG S, V17, P255
[2]   Fuzzy support vector machine based on within-class scatter for classification problems with outliers or noises [J].
An, Wenjuan ;
Liang, Mangui .
NEUROCOMPUTING, 2013, 110 :101-110
[3]  
[Anonymous], cvx Users' Guide
[4]  
[Anonymous], 2014, International Journal of Database Theory and Application
[5]  
Balasundaram S., 2012, International Journal of Advanced Intelligence Paradigms, V4, P199
[6]   FSVM-CIL: Fuzzy Support Vector Machines for Class Imbalance Learning [J].
Batuwita, Rukshan ;
Palade, Vasile .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2010, 18 (03) :558-571
[7]   A new fuzzy twin support vector machine for pattern classification [J].
Chen, Su-Gen ;
Wu, Xiao-Jun .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2018, 9 (09) :1553-1564
[8]   Entropy-Based Fuzzy Twin Bounded Support Vector Machine for Binary Classification [J].
Chen, Sugen ;
Cao, Junfeng ;
Huang, Zhong ;
Shen, Chuansheng .
IEEE ACCESS, 2019, 7 :86555-86569
[9]   Class imbalance learning via a fuzzy total margin based support vector machine [J].
Dai, Hong-Liang .
APPLIED SOFT COMPUTING, 2015, 31 :172-184
[10]  
Demsar J, 2006, J MACH LEARN RES, V7, P1