Semantics supervised cluster-based index for video databases

被引:0
作者
Shi, Zhiping [1 ]
Li, Qingyong
Shil, Zhiwei
Shil, Zhongzhi
机构
[1] Chinese Acad Sci, Inst Comp Technol, Beijing 100080, Peoples R China
[2] Beijing Jiaotong Univ, Sch Comp & Informat Technol, Beijing 100044, Peoples R China
来源
IMAGE AND VIDEO RETRIEVAL, PROCEEDINGS | 2006年 / 4071卷
关键词
high-dimensional index; cluster; video semantics; video database;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High-dimensional index is one of the most challenging tasks for content-based video retrieval (CBVR). Typically, in video database, there exist two kinds of clues for query: visual features and semantic classes. In this paper, we modeled the relationship between semantic classes and visual feature distributions of data set with the Gaussian mixture model (GMM), and proposed a semantics supervised cluster based index approach (briefly as SSCI) to integrate the advantages of both semantic classes and visual features. The entire data set is divided hierarchically by a modified clustering technique into many clusters until the objects within a cluster are not only close in the visual feature space but also within the same semantic class, and then an index entry including semantic clue and visual feature clue is built for each cluster. Especially, the visual feature vectors in a cluster are organized adjacently in disk. So the SSCI-based nearest-neighbor (NN) search can be divided into two phases: the first phase computes the distances between the query example and each cluster index and returns the clusters with the smallest distance, here namely candidate clusters; then the second phase retrieves the original feature vectors within the candidate clusters to gain the approximate nearest neighbors. Our experiments showed that for approximate searching the SSCI-based approach was faster than VA(+)-based approach; moreover, the quality of the result set was better than that of the sequential search in terms of semantics.
引用
收藏
页码:453 / 462
页数:10
相关论文
共 10 条
[1]   A knowledge-based approach to video content classification [J].
Chen, Y ;
Wong, EK .
STORAGE AND RETRIEVAL FOR MEDIA DATABASES 2001, 2001, 4315 :292-300
[2]  
Ferhatosmanoglu H., 2000, Proceedings of the Ninth International Conference on Information and Knowledge Management. CIKM 2000, P202, DOI 10.1145/354756.354820
[3]   Approximate nearest neighbor searching in multimedia databases [J].
Ferhatosmanoglu, H ;
Tuncel, E ;
Agrawal, D ;
El Abbadi, A .
17TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2001, :503-511
[4]  
FISCHER S, 1995, ACM MULTIMEDIA, P295
[5]   Addressing the problems of Bayesian network classification of video using high-dimensional features [J].
Mittal, A ;
Cheong, LF .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2004, 16 (02) :230-244
[6]  
Shi Zhi-Ping, 2005, Journal of Software, V16, P1039, DOI 10.1360/jos161039
[7]  
WEBER KVR, 1998, P ACM SIGMOD ICMD, P166
[8]  
Weber R, 2000, LECT NOTES COMPUT SC, V1777, P21
[9]  
WEBER R, 1998, P ACM VLDB
[10]  
Ye HJ, 2003, LECT NOTES COMPUT SC, V2728, P477