Supervised box clustering

被引:0
作者
Vincenzo Spinelli
机构
[1] Istat-Istituto Nazionale di Statistica,
来源
Advances in Data Analysis and Classification | 2017年 / 11卷
关键词
Supervised clustering; Classification problems; Incompatibility graphs; Homogeneous boxes; 90C27; 90C59; 68Q25;
D O I
暂无
中图分类号
学科分类号
摘要
In this work we address a technique for effectively clustering points in specific convex sets, called homogeneous boxes, having sides aligned with the coordinate axes (isothetic condition). The proposed clustering approach is based on homogeneity conditions, not according to some distance measure, and, even if it was originally developed in the context of the logical analysis of data, it is now placed inside the framework of Supervised clustering. First, we introduce the basic concepts in box geometry; then, we consider a generalized clustering algorithm based on a class of graphs, called incompatibility graphs. For supervised classification problems, we consider classifiers based on box sets, and compare the overall performances to the accuracy levels of competing methods for a wide range of real data sets. The results show that the proposed method performs comparably with other supervised learning methods in terms of accuracy.
引用
收藏
页码:179 / 204
页数:25
相关论文
共 49 条
[1]  
Bárány I(1987)Covering with Euclidean boxes Eur J Comb 8 113-119
[2]  
Lehel J(2011)The maximum box problem for moving points in the plane J Comb Optim 22 517-530
[3]  
Bereg S(2008)Logic classification and feature selection for biomedical data Comput Math Appl 55 889-899
[4]  
Díaz-Bánez JM(1997)Logical analysis of numerical data Math Program 79 163-190
[5]  
Pérez-Lantero P(2004)Exact and approximate discrete optimization algorithms for finding useful disjunctions of categorical predicates in data analysis Discrete Appl Math 144 43-58
[6]  
Ventura I(1911)Über den Variabilitätsbereich der Fourier’schen Konstanten von positiven harmonischen Funktionen Rendiconti del Circolo Matematico di Palermo 32 193-217
[7]  
Bertolazzi P(1996)Computing the maximum bichromatic discrepancy, with applications to computer graphics and machine learning J Comput Syst Sci 52 453-470
[8]  
Felici G(2002)The maximum box problem and its application to data analysis Comput Optim Appl 23 285-298
[9]  
Festa P(1983)Hypergraph families with bounded edge cover or transversal number Combinatorica 3 351-358
[10]  
Lancia G(1992)On movable separability and isotheticity Inf Sci 62 87-102