Nature-inspired framework of ensemble learning for collaborative classification in granular computing context

被引:0
作者
Han Liu
Mihaela Cocea
机构
[1] Cardiff University,School of Computer Science and Informatics
[2] University of Portsmouth,School of Computing
来源
Granular Computing | 2019年 / 4卷
关键词
Machine learning; Ensemble learning; Classification; Bagging; Random forests; Granular computing;
D O I
暂无
中图分类号
学科分类号
摘要
Due to the vast and rapid increase in the size of data, machine learning has become an increasingly popular approach of data classification, which can be done by training a single classifier or a group of classifiers. A single classifier is typically learned by using a standard algorithm, such as C4.5. Due to the fact that each of the standard learning algorithms has its own advantages and disadvantages, ensemble learning, such as Bagging, has been increasingly used to learn a group of classifiers for collaborative classification, thus compensating for the disadvantages of individual classifiers. In particular, a group of base classifiers need to be learned in the training stage, and then some or all of the base classifiers are employed for classifying unseen instances in the testing stage. In this paper, we address two critical points that can impact the classification accuracy, in order to overcome the limitations of the Bagging approach. Firstly, it is important to judge effectively which base classifiers qualify to get employed for classifying test instances. Secondly, the final classification needs to be done by combining the outputs of the base classifiers, i.e. voting, which indicates that the strategy of voting can impact greatly on whether a test instance is classified correctly. In order to address the above points, we propose a nature-inspired approach of ensemble learning to improve the overall accuracy in the setting of granular computing. The proposed approach is validated through experimental studies by using real-life data sets. The results show that the proposed approach overcomes effectively the limitations of the Bagging approach.
引用
收藏
页码:715 / 724
页数:9
相关论文
共 66 条
[1]  
Borra S(2002)Improving nonparametric regression methods by bagging and boosting Comput Stat Data Anal 38 407-420
[2]  
Ciaccio AD(1996)Bagging predictors Mach Learn 24 123-140
[3]  
Breiman L(2001)Random forests Mach Learn 45 5-32
[4]  
Breiman L(2017)Unified granular-number-based ahp-vikor multi-criteria decision framework Granul Comput 2 199-221
[5]  
Chatterjee K(2002)Distributed learning with bagging-like performance Pattern Recogn Lett 24 455-471
[6]  
Kar S(2011)Weighted fuzzy rule interpolation based on ga-based weight-learning techniques IEEE Trans Fuzzy Syst 19 729-744
[7]  
Chawla NV(2011)Handling forecasting problems based on high-order fuzzy logical relationships Expert Syst Appl 38 3857-3864
[8]  
Moore TE(2006)Forecasting enrollments using high-order fuzzy time series and genetic algorithms Int J Inf Manage Sci 17 1-17
[9]  
Hall LO(2011)Fuzzy forecasting based on high-order fuzzy logical relationships and automatic clustering techniques Expert Syst Appl 38 15,425-15,437
[10]  
Bowyer KW(2009)Forecasting enrollments using automatic clustering techniques and fuzzy logical relationships Expert Syst Appl 36 11,070-11,076