Clustering Qualitative Data Based on Binary Equivalence Relations: Neighborhood Search Heuristics for the Clique Partitioning Problem

被引:31
作者
Brusco, Michael J. [1 ]
Koehn, Hans-Friedrich [2 ]
机构
[1] Florida State Univ, Dept Mkt, Coll Business, Tallahassee, FL 32306 USA
[2] Univ Missouri, Columbia, MO 65211 USA
关键词
equivalence relation; clique partitioning; clustering; heuristics; tabu search; simulated annealing; neighborhood search; MINIMUM SUM; CLASSIFICATION; ALGORITHM; FACETS;
D O I
10.1007/s11336-009-9126-z
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
The clique partitioning problem (CPP) requires the establishment of an equivalence relation for the vertices of a graph such that the sum of the edge costs associated with the relation is minimized. The CPP has important applications for the social sciences because it provides a framework for clustering objects measured on a collection of nominal or ordinal attributes. In such instances, the CPP incorporates edge costs obtained from an aggregation of binary equivalence relations among the attributes. We review existing theory and methods for the CPP and propose two versions of a new neighborhood search algorithm for efficient solution. The first version (NS-R) uses a relocation algorithm in the search for improved solutions, whereas the second (NS-TS) uses an embedded tabu search routine. The new algorithms are compared to simulated annealing (SA) and tabu search (TS) algorithms from the CPP literature. Although the heuristics yielded comparable results for some test problems, the neighborhood search algorithms generally yielded the best performances for large and difficult instances of the CPP.
引用
收藏
页码:685 / 703
页数:19
相关论文
共 59 条
[1]  
Arabie P., 1996, Clustering and Classification, P5, DOI 10.1142/9789812832153_0002
[2]  
Barthelemy J. P., 1981, Mathematical Social Sciences, V1, P235, DOI 10.1016/0165-4896(81)90041-X
[3]  
Barthelemy J. P., 1988, Classification and Related Methods of Data Analysis. Proceedings of the First Conference of the International Federation of Classification Societies (IFCS), P309
[4]  
BARTLETT B, 1995, AUST J PUBLIC HEALTH, V19, P3
[5]  
Borda J. D, 1784, MEMOIRE ELECTIONS SC, V102, P657
[6]   A variable neighborhood search method for generalized blockmodeling of two-mode binary matrices [J].
Brusco, Michael ;
Steinley, Douglas .
JOURNAL OF MATHEMATICAL PSYCHOLOGY, 2007, 51 (05) :325-338
[7]   Optimal partitioning of a data set based on the p-median model [J].
Brusco, Michael J. ;
Koehn, Hans-Friedrich .
PSYCHOMETRIKA, 2008, 73 (01) :89-105
[8]   A comparison of heuristic procedures for minimum within-cluster sums of squares partitioning [J].
Brusco, Michael J. ;
Steinley, Douglas .
PSYCHOMETRIKA, 2007, 72 (04) :583-600
[9]   Comment on "clustering by passing messages between data points" [J].
Brusco, Michael J. ;
Koehn, Hans-Friedrich .
SCIENCE, 2008, 319 (5864)
[10]   IMPROVING PERSONNEL SCHEDULING AT AIRLINE STATIONS [J].
BRUSCO, MJ ;
JACOBS, LW ;
BONGIORNO, RJ ;
LYONS, DV ;
TANG, BX .
OPERATIONS RESEARCH, 1995, 43 (05) :741-751