Learning from examples with data reduction and stacked generalization

被引:6
作者
Czarnowski, Ireneusz [1 ]
Jedrzejowicz, Piotr [1 ]
机构
[1] Gdynia Maritime Univ, Dept Informat Syst, Morska 83, PL-81225 Gdynia, Poland
关键词
Learning from big data; data reduction; stacked generalization; kernel-based clustering; NEAREST-NEIGHBOR RULE; INSTANCE SELECTION; ALGORITHM; RANKING; KERNEL;
D O I
10.3233/JIFS-169137
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data reduction can increase generalization abilities of the learning model and shorten learning time. It can be particularly helpful in analyzing big data sets. This paper focuses on the machine learning from examples with data reduction. In the paper data reduction is carried out by selection of relevant instances, called prototypes. The discussed approach bases on the assumption that the selection of prototypes is carried-out by a team of agents and that the prototype instances are selected from clusters of instances under the constraint that from each cluster a single prototype is obtained. For cluster initialization the kernel-based fuzzy clustering algorithm is used. Main feature of the proposed approach is integrating data reduction with the stacking technique. Stacked generalization assures diversification among prototypes, and hence, base classifiers. To validate the proposed approach we have carried-out computational experiment. We have also evaluated experimentally the influence of the clustering method and the number of stacking folds used, on the classification accuracy.
引用
收藏
页码:1401 / 1411
页数:11
相关论文
共 47 条
[41]   An integrated instance-based learning algorithm [J].
Wilson, DR ;
Martinez, TR .
COMPUTATIONAL INTELLIGENCE, 2000, 16 (01) :1-28
[42]  
WOLPER DH, 2001, SUPERVISED LEARNING
[43]   STACKED GENERALIZATION [J].
WOLPERT, DH .
NEURAL NETWORKS, 1992, 5 (02) :241-259
[44]  
Yildirim A.A., 2014, Big Data Management, Technologies, and Applications, P72, DOI [10.4018/978-1-4666-4699-5.ch004, DOI 10.4018/978-1-4666-4699-5.CH004]
[45]  
Yingquan Wu, 2001, Advances in Pattern Recognition - ICAPR 2001. Second International Conference. Proceedings (Lecture Notes in Computer Science Vol.2013), P222
[46]  
Zhou SM, 2004, LECT NOTES COMPUT SC, V3177, P613
[47]  
Zhu XQ, 2006, INT C PATT RECOG, P352