Exploring Multiobjective Optimization for Multiview Clustering

被引:22
作者
Saha, Sriparna [1 ]
Mitra, Sayantan [1 ]
Kramer, Stefan [2 ]
机构
[1] Indian Inst Technol Patna, Dept Comp Sci & Engn, Patna 801103, Bihar, India
[2] Johannes Gutenberg Univ Mainz, Inst Comp Sci, Staudingerweg 9, D-55128 Mainz, Germany
关键词
Multiview classification; multiobjective optimization; simulated annealing; search result clustering; PIXEL CLASSIFICATION; ALGORITHM;
D O I
10.1145/3182181
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a new multiview clustering approach based on multiobjective optimization. In contrast to existing clustering algorithms based on multiobjective optimization, it is generally applicable to data represented by two or more views and does not require specifying the number of clusters a priori. The approach builds upon the search capability of a multiobjective simulated annealing based technique, AMOSA, as the underlying optimization technique. In the first version of the proposed approach, an internal cluster validity index is used to assess the quality of different partitionings obtained using different views. A new way of checking the compatibility of these different partitionings is also proposed and this is used as another objective function. A new encoding strategy and some new mutation operators are introduced. Finally, a new way of computing a consensus partitioning from multiple individual partitions obtained on multiple views is proposed. As a baseline and for comparison, two multiobjective based ensemble clustering techniques are proposed to combine the outputs of different simple clustering approaches. The efficacy of the proposed clustering methods is shown for partitioning several real-world datasets having multiple views. To show the practical usefulness of the method, we present results on web-search result clustering, where the task is to find a suitable partitioning of web snippets.
引用
收藏
页数:30
相关论文
共 51 条
[1]  
[Anonymous], 2013, P ANN M ASS COMP LIN
[2]  
[Anonymous], 2014, 25 INT C COMP LING C
[3]  
[Anonymous], 2009, BMC Bioinformatics
[4]  
[Anonymous], P ICML WORKSH LEARN
[5]   A simulated annealing-based multiobjective optimization algorithm: AMOSA [J].
Bandyopadhyay, Sanghamitra ;
Saha, Sriparna ;
Maulik, Ujjwal ;
Deb, Kalyanmoy .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2008, 12 (03) :269-283
[6]   GAPS: A clustering method using a new point symmetry-based distance measure [J].
Bandyopadhyay, Sanghamitra ;
Saha, Sriparna .
PATTERN RECOGNITION, 2007, 40 (12) :3430-3451
[7]   An improved algorithm for clustering gene expression data [J].
Bandyopadhyay, Sanghamitra ;
Mukhopadhyay, Anirban ;
Maulik, Ujjwal .
BIOINFORMATICS, 2007, 23 (21) :2859-2865
[8]   Multiobjective genetic clustering for pixel classification in remote sensing imagery [J].
Bandyopadhyay, Sanghamitra ;
Maulik, Ujjwal ;
Mukhopadhyay, Anirban .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2007, 45 (05) :1506-1511
[9]  
Ben-Hur Asa, 2003, Methods Mol Biol, V224, P159
[10]  
Bezdek J. C., 1981, Pattern recognition with fuzzy objective function algorithms