ON USING STRUCTURAL PATTERNS IN DATA FOR CLASSIFICATION

被引:2
作者
Arslan, G. [1 ]
Karabulut, B. [2 ]
Unver, H. M. [2 ]
机构
[1] Kirikkale Univ, Dept Stat, Kirikkale, Turkey
[2] Kirikkale Univ, Dept Comp Engn, Kirikkale, Turkey
关键词
structural pattern; clustering; classification; k-means; support vector machine;
D O I
10.17654/AS065010033
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
There are some interesting approaches for classification such as semi-supervised algorithms, algorithms that learn distance functions, and various extensions and generalizations of support vector machines. In this study, we propose a new clustering algorithm that uses similarities only and is used as an intermediate step for classification. The motivation for this combined approach is to obtain information from the data set that can be used for classification. After obtaining a clustering of the data set with the proposed clustering algorithm, we apply different strategies for classification. The results on some data sets show that this approach can have some advantages. For example, when using support vector machines, the size of the training set is reduced, while at the same time, comparable performance results are obtained with a smaller number of support vectors.
引用
收藏
页码:33 / 56
页数:24
相关论文
共 21 条
[1]   Instance selection of linear complexity for big data [J].
Arnaiz-Gonzalez, Alvar ;
Diez-Pastor, Jose-Francisco ;
Rodriguez, Juan J. ;
Garcia-Osorio, Cesar .
KNOWLEDGE-BASED SYSTEMS, 2016, 107 :83-95
[2]  
Arslan Guvenc, 2011, 2 INT FUZZ SYST S FU, P13
[3]  
Banerjee A, 2005, J MACH LEARN RES, V6, P1705
[4]   A nonparametric classification method based on K-associated graphs [J].
Bertini, Joao Roberto, Jr. ;
Zhao, Liang ;
Motta, Robson ;
Lopes, Alneu de Andrade .
INFORMATION SCIENCES, 2011, 181 (24) :5435-5456
[5]   A tutorial on Support Vector Machines for pattern recognition [J].
Burges, CJC .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167
[6]  
Chen YD, 2014, J MACH LEARN RES, V15, P2213
[7]   Efficient Algorithm for Localized Support Vector Machine [J].
Cheng, Haibin ;
Tan, Pang-Ning ;
Jin, Rong .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2010, 22 (04) :537-549
[8]   Rough sets theory for multicriteria decision analysis [J].
Greco, S ;
Matarazzo, B ;
Slowinski, R .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2001, 129 (01) :1-47
[9]  
Hertz Tomer., 2004, International Conference on Machine Learning (ICML), P393
[10]   Twin support vector machines for pattern classification [J].
Jayadeva ;
Khemchandani, R. ;
Chandra, Suresh .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2007, 29 (05) :905-910