Learning from examples with data reduction and stacked generalization

被引:6
作者
Czarnowski, Ireneusz [1 ]
Jedrzejowicz, Piotr [1 ]
机构
[1] Gdynia Maritime Univ, Dept Informat Syst, Morska 83, PL-81225 Gdynia, Poland
关键词
Learning from big data; data reduction; stacked generalization; kernel-based clustering; NEAREST-NEIGHBOR RULE; INSTANCE SELECTION; ALGORITHM; RANKING; KERNEL;
D O I
10.3233/JIFS-169137
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data reduction can increase generalization abilities of the learning model and shorten learning time. It can be particularly helpful in analyzing big data sets. This paper focuses on the machine learning from examples with data reduction. In the paper data reduction is carried out by selection of relevant instances, called prototypes. The discussed approach bases on the assumption that the selection of prototypes is carried-out by a team of agents and that the prototype instances are selected from clusters of instances under the constraint that from each cluster a single prototype is obtained. For cluster initialization the kernel-based fuzzy clustering algorithm is used. Main feature of the proposed approach is integrating data reduction with the stacking technique. Stacked generalization assures diversification among prototypes, and hence, base classifiers. To validate the proposed approach we have carried-out computational experiment. We have also evaluated experimentally the influence of the clustering method and the number of stacking folds used, on the classification accuracy.
引用
收藏
页码:1401 / 1411
页数:11
相关论文
共 47 条
[1]   INSTANCE-BASED LEARNING ALGORITHMS [J].
AHA, DW ;
KIBLER, D ;
ALBERT, MK .
MACHINE LEARNING, 1991, 6 (01) :37-66
[2]  
ANDREWS NO, 2007, TR0736 VIRG TECH
[3]  
[Anonymous], 2003, KNOWL INF SYST
[4]  
[Anonymous], 1996, 185996 EDRC CARN MEL
[5]  
Asuncion A., 2007, Uci machine learning repository
[6]   Adaptive integrated image segmentation and object recognition [J].
Bhanu, B ;
Peng, J .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2000, 30 (04) :427-441
[7]  
Bull L., 2004, STUDIES FUZZINESS SO
[8]   On the combination of evolutionary algorithms and stratified strategies for training set selection in data mining [J].
Cano, JR ;
Herrera, F ;
Lozano, M .
APPLIED SOFT COMPUTING, 2006, 6 (03) :323-332
[9]   A density-based approach for instance selection [J].
Carbonera, Joel Luis ;
Abel, Mara .
2015 IEEE 27TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2015), 2015, :768-774
[10]  
Czarnowski I, 2004, BCS CONFERENCE S, P267