A fuzzy association rule-based classifier for imbalanced classification problems

被引:32
作者
Sanz, J. [1 ]
Sesma-Sara, M.
Bustince, H.
机构
[1] Univ Publ Navarra, Dept Stat Comp Sci & Math, Campus Arrosadia S-N, Navarra 31006, Spain
关键词
Fuzzy association rule-based classifier; Imbalanced classification problems; Lift; Averaging aggregation functions; OVERLAP FUNCTIONS; EVOLUTIONARY; MULTICLASS; PROPOSAL; SYSTEMS; IDENTIFICATION; PREDICTION; ENSEMBLES; SOFTWARE; SMOTE;
D O I
10.1016/j.ins.2021.07.019
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Imbalanced classification problems are attracting the attention of the research community because they are prevalent in real-world problems and they impose extra difficulties for learning methods. Fuzzy rule-based classification systems have been applied to cope with these problems, mostly together with sampling techniques. In this paper, we define a new fuzzy association rule-based classifier, named FARCI, to tackle directly imbalanced classifi-cation problems. Our new proposal belongs to the algorithm modification category, since it is constructed on the basis of the state-of-the-art fuzzy classifier FARC-HD. Specifically, we modify its three learning stages, aiming at boosting the number of fuzzy rules of the minor -ity class as well as simplifying them and, for the sake of handling unequal fuzzy rule lengths, we also change the matching degree computation, which is a key step of the infer-ence process and it is also involved in the learning process. In the experimental study, we analyze the effectiveness of each one of the new components in terms of performance, F -score, and rule base size. Moreover, we also show the superiority of the new method when compared versus FARC-HD alongside sampling techniques, another algorithm mod-ification approach, two cost-sensitive methods and an ensemble. (c) 2021 The Authors. Published by Elsevier Inc. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:265 / 279
页数:15
相关论文
共 50 条
[1]  
Agrawal R., P 20 INT C VERY LARG
[2]   A proposal for the genetic lateral tuning of linguistic fuzzy systems and its interaction with rule selection [J].
Alcala, Rafael ;
Alcala-Fdez, Jesus ;
Herrera, Francisco .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2007, 15 (04) :616-635
[3]   KEEL: a software tool to assess evolutionary algorithms for data mining problems [J].
Alcala-Fdez, J. ;
Sanchez, L. ;
Garcia, S. ;
del Jesus, M. J. ;
Ventura, S. ;
Garrell, J. M. ;
Otero, J. ;
Romero, C. ;
Bacardit, J. ;
Rivas, V. M. ;
Fernandez, J. C. ;
Herrera, F. .
SOFT COMPUTING, 2009, 13 (03) :307-318
[4]   A Fuzzy Association Rule-Based Classification Model for High-Dimensional Problems With Genetic Rule Selection and Lateral Tuning [J].
Alcala-Fdez, Jesus ;
Alcala, Rafael ;
Herrera, Francisco .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2011, 19 (05) :857-872
[5]  
[Anonymous], 2004, Classification and Modeling with Linguistic Information Granules: Advanced Approaches to Linguistic Data Mining
[6]   An experimental study on evolutionary fuzzy classifiers designed for managing imbalanced datasets [J].
Antonelli, Michela ;
Ducange, Pietro ;
Marcelloni, Francesco .
NEUROCOMPUTING, 2014, 146 :125-136
[7]   A Compact Evolutionary Interval-Valued Fuzzy Rule-Based Classification System for the Modeling and Prediction of Real-World Financial Applications With Imbalanced Data [J].
Antonio Sanz, Jose ;
Bernardo, Dario ;
Herrera, Francisco ;
Bustince, Humberto ;
Hagras, Hani .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2015, 23 (04) :973-990
[8]   General interval-valued overlap functions and interval-valued overlap indices [J].
Asmus, Tiago da Cruz ;
Dimuro, Gracaliz Pereira ;
Bedregal, Benjamin ;
Sanz, Jose Antonio ;
Pereira Jr, Sidnei ;
Bustince, Humberto .
INFORMATION SCIENCES, 2020, 527 :27-50
[9]   Strategies for learning in class imbalance problems [J].
Barandela, R ;
Sánchez, JS ;
García, V ;
Rangel, E .
PATTERN RECOGNITION, 2003, 36 (03) :849-851
[10]  
Batista G.E.A.P.A., 2004, ACM SIGKDD Explor. Newsl, V6, P20, DOI [DOI 10.1145/1007730.1007735, 10.1145/1007730.1007735]