A hierarchical genetic fuzzy system based on genetic programming for addressing classification with highly imbalanced and borderline data-sets

被引:69
|
作者
Lopez, Victoria [1 ]
Fernandez, Alberto [2 ]
Jose del Jesus, Maria [2 ]
Herrera, Francisco [1 ]
机构
[1] Univ Granada, Dept Comp Sci & Artificial Intelligence, Res Ctr Informat & Commun Technol, CITIC,UGR, E-18071 Granada, Spain
[2] Univ Jaen, Dept Comp Sci, Jaen 23071, Spain
关键词
Fuzzy rule based classification systems; Hierarchical fuzzy partitions; Genetic rule selection; Tuning; Imbalanced data-sets; Borderline examples; SOFTWARE TOOL; ALGORITHMS; PROPOSAL; RECOGNITION; PERFORMANCE; ACCURACY; TAXONOMY; KEEL;
D O I
10.1016/j.knosys.2012.08.025
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Lots of real world applications appear to be a matter of classification with imbalanced data-sets. This problem arises when the number of instances from one class is quite different to the number of instances from the other class. Traditionally, classification algorithms are unable to correctly deal with this issue as they are biased towards the majority class. Therefore, algorithms tend to misclassify the minority class which usually is the most interesting one for the application that is being sorted out. Among the available learning approaches, fuzzy rule-based classification systems have obtained a good behavior in the scenario of imbalanced data-sets. In this work, we focus on some modifications to further improve the performance of these systems considering the usage of information granulation. Specifically, a positive synergy between data sampling methods and algorithmic modifications is proposed, creating a genetic programming approach that uses linguistic variables in a hierarchical way. These linguistic variables are adapted to the context of the problem with a genetic process that combines rule selection with the adjustment of the lateral position of the labels based on the 2-tuples linguistic model. An experimental study is carried out over highly imbalanced and borderline imbalanced data-sets which is completed by a statistical comparative analysis. The results obtained show that the proposed model outperforms several fuzzy rule based classification systems, including a hierarchical approach and presents a better behavior than the C4.5 decision tree. (c) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:85 / 104
页数:20
相关论文
共 50 条
  • [21] Data classification by a fuzzy genetic system approach
    Espíndola, RP
    Ebecken, NFF
    DATA MINING IV, 2004, 7 : 245 - 254
  • [22] Stochastic Semantic-Based Multi-objective Genetic Programming Optimisation for Classification of Imbalanced Data
    Galvan-Lopez, Edgar
    Vazquez-Mendoza, Lucia
    Trujillo, Leonardo
    ADVANCES IN SOFT COMPUTING, MICAI 2016, PT II, 2017, 10062 : 261 - 272
  • [23] Genetic programming based data projections for classification tasks
    Estébanez, C
    Aler, R
    Valls, JM
    ENFORMATIKA, VOL 7: IEC 2005 PROCEEDINGS, 2005, : 56 - 61
  • [24] Genetic Programming Based Data Projections for Classification Tasks
    Estebanez, Cesar
    Aler, Ricardo
    Valls, Jose M.
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 7, 2005, 7 : 56 - 61
  • [25] A Genetic-Based Ensemble Learning Applied to Imbalanced Data Classification
    Klikowski, Jakub
    Ksieniewicz, Pawel
    Wozniak, Michal
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING (IDEAL 2019), PT II, 2019, 11872 : 340 - 352
  • [26] Scale Genetic Programming for large Data Sets: Case of Higgs Bosons Classification
    Hmida, Hmida
    Ben Hamida, Sana
    Borgi, Amel
    Rukoz, Marta
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KES-2018), 2018, 126 : 302 - 311
  • [27] A constrained-syntax genetic programming system for discovering classification rules: application to medical data sets
    Bojarczuk, CC
    Lopes, HS
    Freitas, AA
    Michalkiewicz, EL
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2004, 30 (01) : 27 - 48
  • [28] HANDLING HIGHLY-DIMENSIONAL CLASSIFICATION TASKS WITH HIERARCHICAL GENETIC FUZZY RULE-BASED CLASSIFIERS
    Stavrakoudis, Dimitris G.
    Theocharis, John B.
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2012, 20 : 73 - 104
  • [29] Hierarchical-interpolative Fuzzy System Construction by Genetic and Bacterial Programming Algorithms
    Balazs, Krisztian
    Koczy, Laszlo T.
    IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ 2011), 2011, : 2116 - 2122
  • [30] Genetic Network Programming for Fuzzy Association Rule-Based Classification
    Taboada, Karla
    Mabu, Shingo
    Gonzales, Eloy
    Shimada, Kaoru
    Hirasawa, Kotaro
    2009 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-5, 2009, : 2387 - 2394