Clustering Web video search results based on integration of multiple features

被引:0
作者
Alex Hindle
Jie Shao
Dan Lin
Jiaheng Lu
Rui Zhang
机构
[1] The University of Melbourne,Department of Computer Science and Software Engineering
[2] Missouri University of Science and Technology,Department of Computer Science
[3] Renmin University of China,School of Information and DEKE, MOE
来源
World Wide Web | 2011年 / 14卷
关键词
Web video; YouTube; search results clustering; user interface;
D O I
暂无
中图分类号
学科分类号
摘要
The usage of Web video search engines has been growing at an explosive rate. Due to the ambiguity of query terms and duplicate results, a good clustering of video search results is essential to enhance user experience as well as improve retrieval performance. Existing systems that cluster videos only consider the video content itself. This paper presents the first system that clusters Web video search results by fusing the evidences from a variety of information sources besides the video content such as title, tags and description. We propose a novel framework that can integrate multiple features and enable us to adopt existing clustering algorithms. We discuss our careful design of different components of the system and a number of implementation decisions to achieve high effectiveness and efficiency. A thorough user study shows that with an innovative interface showing the clustering output, our system delivers a much better presentation of search results and hence increases the usability of video search engines significantly.
引用
收藏
页码:53 / 73
页数:20
相关论文
共 69 条
[1]  
Banerjee A(2005)Clustering with Bregman divergences J. Mach. Learn. Res. 6 1705-1749
[2]  
Merugu S(2009)Social propagation: boosting social annotations for web mining World Wide Web 12 399-420
[3]  
Dhillon IS(2003)Efficient video similarity measurement with video signature IEEE Trans. Circuits Syst. Video Technol. 13 59-74
[4]  
Ghosh J(2009)The effectiveness of latent semantic analysis for building up a bottom-up taxonomy from folksonomy tags World Wide Web 12 421-440
[5]  
Bao S(2007)Clustering by passing messages between data points Science 315 972-976
[6]  
Yang B(2010)Building a usable and accessible semantic Web interaction platform World Wide Web 13 143-167
[7]  
Fei B(2000)Real life, real users, and real needs: a study and analysis of user queries on the Web Inf. Process. Manag. 36 207-227
[8]  
Xu S(2006)Sentence similarity based on semantic nets and corpus statistics IEEE Trans. Knowl. Data Eng. 18 1138-1150
[9]  
Su Z(2007)A new algorithm for clustering search results Data Knowl. Eng. 62 504-522
[10]  
Yu Y(2005)A concept-driven algorithm for clustering search results IEEE Intell. Syst. 20 48-54