Unsupervised Video Shot Detection Using Clustering Ensemble with a Color Global Scale-Invariant Feature Transform Descriptor

被引:15
作者
Chang, Yuchou [1 ]
Lee, D. J. [1 ]
Hong, Yi [2 ]
Archibald, James [1 ]
机构
[1] Brigham Young Univ, Dept Elect & Comp Engn, Provo, UT 84602 USA
[2] City Univ Hong Kong, Dept Comp Sci, Kowloon, Hong Kong, Peoples R China
关键词
D O I
10.1155/2008/860743
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Scale-invariant feature transform (SIFT) transforms a grayscale image into scale-invariant coordinates of local features that are invariant to image scale, rotation, and changing viewpoints. Because of its scale-invariant properties, SIFT has been successfully used for object recognition and content-based image retrieval. The biggest drawback of SIFT is that it uses only grayscale information and misses important visual information regarding color. In this paper, we present the development of a novel color feature extraction algorithm that addresses this problem, and we also propose a new clustering strategy using clustering ensembles for video shot detection. Based on Fibonacci lattice-quantization, we develop a novel color global scale-invariant feature transform (CGSIFT) for better description of color contents in video frames for video shot detection. CGSIFT first quantizes a color image, representing it with a small number of color indices, and then uses SIFT to extract features from the quantized color index image. We also develop a new space description method using small image regions to represent global color features as the second step of CGSIFT. Clustering ensembles focusing on knowledge reuse are then applied to obtain better clustering results than using single clustering methods for video shot detection. Evaluation of the proposed feature extraction algorithm and the new clustering strategy using clustering ensembles reveals very promising results for video shot detection. Copyright (C) 2008 Yuchou Chang et al.
引用
收藏
页数:10
相关论文
共 35 条
[1]  
[Anonymous], 2003, VIDEO CONTENT ANAL U
[2]  
Berkhin P., 2002, SURVEY CLUSTERING MI
[3]   Video shot detection and condensed representation [J].
Cotsaces, C ;
Nikolaidis, N ;
Pitas, I .
IEEE SIGNAL PROCESSING MAGAZINE, 2006, 23 (02) :28-37
[4]  
DESELAERS T, 2003, THESIS AACHEN U AACH
[5]   Combining multiple clusterings using evidence accumulation [J].
Fred, ALN ;
Jain, AK .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (06) :835-850
[6]   Video hierarchical structure mining [J].
Fu, Chang-Jian ;
Li, Guo-Hui ;
Wu, Jun-Tao ;
Fu, Chang-Jian .
2006 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS PROCEEDINGS, VOLS 1-4: VOL 1: SIGNAL PROCESSING, 2006, :2150-+
[7]  
Gervautz M., 1988, NEW TRENDS COMPUTER, P219, DOI [DOI 10.1007/978-3-642-83492-9_20, 10.1007/978-3-642-83492-9_20]
[8]  
Grama Ananth, 2003, Introduction to Parallel Computing
[9]  
Heckbert P., 1982, Computer Graphics, V16, P297, DOI 10.1145/965145.801294
[10]  
JAIN AK, 1989, INFORM SYSTEM SCI SE