A sample-based hierarchical adaptive K-means clustering method for large-scale video retrieval

被引:31
作者
Liao, Kaiyang [1 ]
Liu, Guizhong [1 ]
Xiao, Li [1 ]
Liu, Chaoteng [1 ]
机构
[1] Xi An Jiao Tong Univ, Sch Elect & Informat Engn, Xian 710049, Peoples R China
基金
中国国家自然科学基金;
关键词
Data clustering; K-means algorithm; Pattern recognition; Content-based video copy detection; Large-scale data; ALGORITHMS;
D O I
10.1016/j.knosys.2013.05.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Finding useful patterns in large datasets has attracted considerable interest recently, and one of the most widely studied problems in this area is the identification of clusters in a multi-dimensional dataset. This paper introduces a sample-based hierarchical adaptive K-means (SHAKM) clustering algorithm for large-scale video retrieval. To handle large databases efficiently, SHAKM employs a multilevel random sampling strategy. Furthermore, SHAKM utilises the adaptive K-means clustering algorithm to determine the correct number of clusters and to construct an unbalanced cluster tree. Furthermore, SHAKM uses the fast labelling scheme to assign each pattern in the dataset to the closest cluster. To evaluate the proposed method, several datasets are used to illustrate its effectiveness. The results show that SHAKM is fast and effective on very large datasets. Furthermore, the results demonstrate that the proposed method can be used efficiently and successfully for a project on content-based video copy detection. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:123 / 133
页数:11
相关论文
共 39 条
[1]  
[Anonymous], 2006, P 12 ACM SIGKDD INT, DOI [10.1145/1150402.1150467, DOI 10.1145/1150402.1150467]
[2]  
[Anonymous], MODERN INFORM RETRIE
[3]  
[Anonymous], 1996, SIGMOD REC ACM SPEC, DOI DOI 10.1145/235968.233324
[4]  
[Anonymous], 2006, 2006 IEEE COMP SOC C
[5]  
[Anonymous], 02017 U MIN DEP COMP
[6]  
[Anonymous], CLUSTERING CLASSIFIC
[7]  
[Anonymous], 1993, Handbook of pattern recognition and computer vision, DOI DOI 10.1142/9789814343138_0001
[8]   Fast modified global k-means algorithm for incremental cluster construction [J].
Bagirov, Adil M. ;
Ugon, Julien ;
Webb, Dean .
PATTERN RECOGNITION, 2011, 44 (04) :866-876
[9]  
Ball H.G., 1964, P INT C MICR CIRC TH, V6, P281
[10]   Approximate clustering in very large relational data [J].
Bezdek, James C. ;
Hathaway, Richard J. ;
Huband, Jacalyn M. ;
Leckie, Christopher ;
Kotagiri, Ramamohanarao .
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2006, 21 (08) :817-841