Multiple strong and balanced cluster-based ensemble of deep learners

被引:12
作者
Jan, Zohaib [1 ]
Verma, Brijesh [1 ]
机构
[1] Cent Queensland Univ, Ctr Intelligent Syst, Brisbane, Qld 4000, Australia
基金
澳大利亚研究理事会;
关键词
Deep learning; Ensemble classifier; Neural networks; Clustering; CLASSIFIER ENSEMBLES; RANDOM FORESTS; REGRESSION;
D O I
10.1016/j.patcog.2020.107420
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional Neural Networks (CNNs), also known as deep learners have seen much success in the last few years due to the availability of large amounts of data and high-performance computational resources. A CNN can be trained effectively if large amounts of data are available as it enables a CNN to find the optimal set of features and weights that can achieve the highest generalization performance. However, due to the requirement of large data size, CNNs require a lot of resources for example running time and computational resources to achieve a reasonable performance. Additionally, unbalanced data makes it difficult to train a CNN effectively that can achieve good generalization performance. In order to alleviate these limitations, in this paper, we propose a novel ensemble of deep learners that learns by combining multiple deep learners trained on small strongly class associated input data effectively. We propose a novel methodology of generating random subspace through clustering input data and propose a measure which can classify each cluster as a strong data cluster and a balanced data cluster. A methodology is also proposed that balances all strong data clusters in the pool so that an architecturally simple CNN can be trained on all balanced data clusters simultaneously. Classification decisions on all trained CNNs are then fused through majority voting to generate class decisions of the ensemble. The performance of the proposed ensemble approach is evaluated on UCI benchmark datasets, and results are compared with existing state-of-the-art ensemble approaches. Significance testing was conducted to further validate the efficacy of the results and a significance test analysis is presented. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] GUIDED INPAINTING WITH CLUSTER-BASED AUXILIARY INFORMATION
    Maugey, Thomas
    Frossard, Pascal
    Guillemot, Christine
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 1702 - 1706
  • [42] SPOKEN LANGUAGE RECOGNITION WITH CLUSTER-BASED MODELING
    Kacprzak, Stanislaw
    Rybicka, Magdalena
    Kowalczyk, Konrad
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6867 - 6871
  • [43] Cluster-based color matching for image retrieval
    Kankanhalli, MS
    Mehtre, BM
    Wu, JK
    PATTERN RECOGNITION, 1996, 29 (04) : 701 - 708
  • [44] Cluster-Based Profile Analysis in Phase I
    Chen, Yajuan
    Birch, Jeffrey B.
    Woodall, William H.
    JOURNAL OF QUALITY TECHNOLOGY, 2015, 47 (01) : 14 - 29
  • [45] Cluster-Based Prediction for Batteries in Data Centers
    Haider, Syed Naeem
    Zhao, Qianchuan
    Li, Xueliang
    ENERGIES, 2020, 13 (05)
  • [46] Cluster-Based Similarity Search in Time Series
    Karamitopoulos, Leonidas
    Evangelidis, Georgios
    PROCEEDINGS OF THE 2009 FOURTH BALKAN CONFERENCE IN INFORMATICS, 2009, : 113 - 118
  • [47] Nearest cluster-based intrusion detection through convolutional neural networks
    Andresini, Giuseppina
    Appice, Annalisa
    Malerba, Donato
    KNOWLEDGE-BASED SYSTEMS, 2021, 216
  • [48] Cluster-Based Secure Aggregation for Federated Learning
    Kim, Jien
    Park, Gunryeong
    Kim, Miseung
    Park, Soyoung
    ELECTRONICS, 2023, 12 (04)
  • [49] A New Cluster-based Instance Selection Algorithm
    Czarnowski, Ireneusz
    Jedrzejowicz, Piotr
    AGENT AND MULTI-AGENT SYSTEMS: TECHNOLOGIES AND APPLICATIONS, 2011, 6682 : 436 - 445
  • [50] Cluster-based industrialization in China: Financing and performance
    Long, Cheryl
    Zhang, Xiaobo
    JOURNAL OF INTERNATIONAL ECONOMICS, 2011, 84 (01) : 112 - 123