A clustering-based discretization for supervised learning

被引:39
|
作者
Gupta, Ankit [2 ]
Mehrotra, Kishan G. [1 ]
Mohan, Chilukuri [1 ]
机构
[1] Syracuse Univ, Dept Elect Engn & Comp Sci, Ctr Sci & Technol 4 106, Syracuse, NY 13244 USA
[2] Indian Inst Technol, Dept Elect Engn, Kanpur 208016, Uttar Pradesh, India
关键词
Discretization; Clustering; Binning; Supervised learning;
D O I
10.1016/j.spl.2010.01.015
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We address the problem of discretization of continuous variables for machine learning classification algorithms. Existing procedures do not use interdependence between the variables towards this goal. Our proposed method uses clustering to exploit such interdependence. Numerical results show that this improves the classification performance in almost all cases. Even if an existing algorithm can successfully operate with continuous variables, better performance is obtained if the variables are first discretized. An additional advantage of discretization is that it reduces the overall computation time. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:816 / 824
页数:9
相关论文
共 50 条
  • [21] Clustering-based incremental learning for imbalanced data classification
    Liu, Yuxin
    Du, Guangyu
    Yin, Chenke
    Zhang, Haichao
    Wang, Jia
    KNOWLEDGE-BASED SYSTEMS, 2024, 292
  • [22] Clustering-based incremental learning for imbalanced data classification
    Liu, Yuxin
    Du, Guangyu
    Yin, Chenke
    Zhang, Hachao
    Wang, Jia
    Knowledge-Based Systems, 2024, 292
  • [23] Clustering-based attack detection for adversarial reinforcement learning
    Majadas, Ruben
    Garcia, Javier
    Fernandez, Fernando
    APPLIED INTELLIGENCE, 2024, 54 (03) : 2631 - 2647
  • [24] Consensus Clustering-Based Undersampling Approach to Imbalanced Learning
    Onan, Aytug
    SCIENTIFIC PROGRAMMING, 2019, 2019
  • [25] Representation Learning by Denoising Autoencoders for Clustering-based Classification
    Owhadi-Kareshk, Moein
    Akbarzadeh-T, Mohammad-R
    2015 5TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2015, : 228 - 233
  • [26] A Clustering-Based Method for Team Formation in Learning Environments
    Guijarro-Mata-Garcia, Marta
    Guijarro, Maria
    Fuentes-Fernandez, Ruben
    Hybrid Artificial Intelligent Systems, 2016, 9648 : 475 - 486
  • [27] LEARNING CLUSTERING-BASED LINEAR MAPPINGS FOR QUANTIZATION NOISE REMOVAL
    Alain, Martin
    Guillemot, Christine
    Thoreau, Dominique
    Guillotel, Philippe
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 4200 - 4204
  • [28] CLUSTERING-BASED SUBSET ENSEMBLE LEARNING METHOD FOR IMBALANCED DATA
    Hu, Xiao-Sheng
    Zhang, Run-Jing
    PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOLS 1-4, 2013, : 35 - 39
  • [29] Energy Demand Prediction with Optimized Clustering-Based Federated Learning
    Perry, Dylan
    Wang, Ning
    Ho, Shen-Shyang
    2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
  • [30] ClusterCNN: Clustering-Based Feature Learning for Hyperspectral Image Classification
    Yao, Wei
    Lian, Cheng
    Bruzzone, Lorenzo
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (11) : 1991 - 1995