A hierarchical genetic fuzzy system based on genetic programming for addressing classification with highly imbalanced and borderline data-sets

被引:69
|
作者
Lopez, Victoria [1 ]
Fernandez, Alberto [2 ]
Jose del Jesus, Maria [2 ]
Herrera, Francisco [1 ]
机构
[1] Univ Granada, Dept Comp Sci & Artificial Intelligence, Res Ctr Informat & Commun Technol, CITIC,UGR, E-18071 Granada, Spain
[2] Univ Jaen, Dept Comp Sci, Jaen 23071, Spain
关键词
Fuzzy rule based classification systems; Hierarchical fuzzy partitions; Genetic rule selection; Tuning; Imbalanced data-sets; Borderline examples; SOFTWARE TOOL; ALGORITHMS; PROPOSAL; RECOGNITION; PERFORMANCE; ACCURACY; TAXONOMY; KEEL;
D O I
10.1016/j.knosys.2012.08.025
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Lots of real world applications appear to be a matter of classification with imbalanced data-sets. This problem arises when the number of instances from one class is quite different to the number of instances from the other class. Traditionally, classification algorithms are unable to correctly deal with this issue as they are biased towards the majority class. Therefore, algorithms tend to misclassify the minority class which usually is the most interesting one for the application that is being sorted out. Among the available learning approaches, fuzzy rule-based classification systems have obtained a good behavior in the scenario of imbalanced data-sets. In this work, we focus on some modifications to further improve the performance of these systems considering the usage of information granulation. Specifically, a positive synergy between data sampling methods and algorithmic modifications is proposed, creating a genetic programming approach that uses linguistic variables in a hierarchical way. These linguistic variables are adapted to the context of the problem with a genetic process that combines rule selection with the adjustment of the lateral position of the labels based on the 2-tuples linguistic model. An experimental study is carried out over highly imbalanced and borderline imbalanced data-sets which is completed by a statistical comparative analysis. The results obtained show that the proposed model outperforms several fuzzy rule based classification systems, including a hierarchical approach and presents a better behavior than the C4.5 decision tree. (c) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:85 / 104
页数:20
相关论文
共 50 条
  • [41] Feature Selection for Multi-Class Imbalanced Data Sets Based on Genetic Algorithm
    Du L.-M.
    Xu Y.
    Zhu H.
    Ann. Data Sci., 3 (293-300): : 293 - 300
  • [42] INTELLIGENT SYSTEM BASED ON GENETIC PROGRAMMING FOR ATRIAL FIBRILLATION CLASSIFICATION
    Valenzuela, Olga
    Rojas, Ignacio
    Rojas, Francisco Javier
    Pomares, Hector
    Luis Bernier, Jose
    Herrera, Javier
    Guillen, Alberto
    APPLIED ARTIFICIAL INTELLIGENCE, 2009, 23 (10) : 895 - 909
  • [43] Genetic and Bacterial Memetic Programming Approaches in Hierarchical-Interpolative Fuzzy System Construction
    Balazs, Krisztian
    Koczy, Laszlo T.
    2012 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2012,
  • [44] HIERARCHICAL-INTERPOLATIVE FUZZY SYSTEM CONSTRUCTION BY GENETIC AND BACTERIAL MEMETIC PROGRAMMING APPROACHES
    Balazs, Krisztian
    Koczy, Laszlo T.
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2012, 20 : 105 - 131
  • [45] Binary text classification using genetic programming with crossover-based oversampling for imbalanced datasets
    Aljero, Mona Khalifa A.
    Dimililer, Nazife
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2023, 31 (01) : 180 - 192
  • [46] An analysis of the rule weights and fuzzy reasoning methods for linguistic rule based classification systems applied to problems with highly imbalanced data sets
    Fernandez, Alberto
    Garcia, Salvador
    Herrera, Francisco
    Del Jesus, Maria Jose
    APPLICATIONS OF FUZZY SETS THEORY, 2007, 4578 : 170 - +
  • [47] Analysing the Hierarchical Fuzzy Rule Based Classification Systems with Genetic Rule Selection
    Fernandez, A.
    del Jesus, M. J.
    Herrera, F.
    2010 FOURTH INTERNATIONAL WORKSHOP ON GENETIC AND EVOLUTIONARY FUZZY SYSTEMS (GEFS 2010), 2010, : 69 - 74
  • [48] A Genetic Programming-Based Imputation Method for Classification with Missing Data
    Cao Truong Tran
    Zhang, Mengjie
    Andreae, Peter
    GENETIC PROGRAMMING, EUROGP 2016, 2016, 9594 : 149 - 163
  • [49] BGFS: Design and Development of Brain Genetic Fuzzy System for Data Classification
    Ravi, Chandrasekar
    Khare, Neelu
    JOURNAL OF INTELLIGENT SYSTEMS, 2018, 27 (02) : 231 - 247
  • [50] Classification of healthcare data using genetic fuzzy logic system and wavelets
    Thanh Nguyen
    Khosravi, Abbas
    Creighton, Douglas
    Nahavandi, Saeid
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (04) : 2184 - 2197