GEP-based classifier for mining imbalanced data

被引:12
|
作者
Jedrzejowicz, Joanna [1 ]
Jedrzejowicz, Piotr [2 ]
机构
[1] Univ Gdansk, Inst Informat, Fac Math Phys & Informat, PL-80308 Gdansk, Poland
[2] Gdynia Maritime Univ, Dept Informat Syst, Morska 83, PL-81225 Gdynia, Poland
关键词
Imbalanced classification; Incremental learning; Gene expression programming; DATA STREAMS; RULES;
D O I
10.1016/j.eswa.2020.114058
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The paper proposes an incremental Gene Expression Programming classifier for mining imbalanced datasets. Imbalanced datasets are commonly encountered in real-life applications. There exist numerous algorithms, techniques, and tools which are proposed as suitable for dealing with imbalanced class distribution. Yet, none of them seems to be able to outperform all others in all possible applications. We believe that our approach can extend the available range of learners that have proven good performance in mining imbalanced data and imbalanced streams. The idea is to adapt the GEP classifier to requirements of the imbalanced data environment with reuse of the minority class instances, and application of the incremental learning paradigm. The paper offers an overview of the related work and a detailed description of the proposed incremental learner. An extensive computational experiment, based on data from the KEEL dataset repository, proves that in numerous cases the approach is competitive to other state-of-the-art learners.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] FIR as Classifier in the Presence of Imbalanced Data
    Bagherpour, Solmaz
    Nebot, Angela
    Mugica, Francisco
    ADVANCES IN NEURAL NETWORKS - ISNN 2016, 2016, 9719 : 490 - 496
  • [22] A fuzzy classifier for imbalanced and noisy data
    Visa, S
    Ralescu, A
    2004 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, PROCEEDINGS, 2004, : 1727 - 1732
  • [23] A Classifier Hub for Imbalanced Financial Data
    Abeysinghe, Chirath
    Li, Jianguo
    He, Jing
    DATABASES THEORY AND APPLICATIONS, (ADC 2016), 2016, 9877 : 476 - 479
  • [24] Hybrid Classifier Ensemble for Imbalanced Data
    Yang, Kaixiang
    Yu, Zhiwen
    Wen, Xin
    Cao, Wenming
    Chen, C. L. Philip
    Wong, Hau-San
    You, Jane
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (04) : 1387 - 1400
  • [25] An improved fuzzy classifier for imbalanced data
    Yan, Dandan
    Yang, Youlong
    Li, Benchong
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2017, 32 (03) : 2315 - 2325
  • [26] A GEP-based spatial decision support system for multisite land use allocation
    Eldrandaly, Khalid
    APPLIED SOFT COMPUTING, 2010, 10 (03) : 694 - 702
  • [27] An Agent Based Rough Classifier for Data Mining
    Abu Bakar, Azuraliza
    Othman, Zulaiha Ali
    Hamdan, Abdul Razak
    Yusof, Rozianiwati
    Ismail, Ruhaizan
    ISDA 2008: EIGHTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 1, PROCEEDINGS, 2008, : 145 - 151
  • [28] Data Mining on Imbalanced Data Sets
    Gu, Qiong
    Cai, Zhihua
    Zhu, Li
    Huang, Bo
    2008 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER THEORY AND ENGINEERING, 2008, : 1020 - 1024
  • [29] A New Classifier for Imbalanced Data Based on a Generalized Density Ratio Model
    Li, Junjun
    Cui, Wenquan
    COMMUNICATIONS IN MATHEMATICS AND STATISTICS, 2023, 11 (02) : 369 - 401
  • [30] Information granulation based data mining approach for classifying imbalanced data
    Chen, Mu-Chen
    Chen, Long-Sheng
    Hsu, Chun-Chin
    Zeng, Wei-Rong
    INFORMATION SCIENCES, 2008, 178 (16) : 3214 - 3227