Multimodal news story clustering with pairwise visual near-duplicate constraint

被引:50
作者
Wu, Xiao [1 ,2 ]
Ngo, Chong-Wah [2 ]
Hauptmann, Alexander G. [1 ]
机构
[1] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
[2] City Univ Hong Kong, Dept Comp Sci, Kowloon, Hong Kong, Peoples R China
关键词
multimedia topic detection and tracking; near-duplicate visual constraint; news story clustering; video data mining;
D O I
10.1109/TMM.2007.911778
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Story clustering is a critical step for news retrieval, topic mining, and summarization. Nonetheless, the task remains highly challenging owing to the fact that news topics exhibit clusters of varying densities, shapes, and sizes. Traditional algorithms are found to be ineffective in mining these types of clusters. This paper offers a new perspective by exploring the pairwise visual cues deriving from near-duplicate keyframes (NDK) for constraint-based clustering. We propose a constraint-driven co-clustering algorithm (CCC), which utilizes the near-duplicate constraints built on top of text, to mine topic-related stories and the outliers. With CCC, the duality between stories and their underlying multimodal features is exploited to transform features in low-dimensional space with normalized cut. The visual constraints are added directly to this new space, while the traditional DBSCAN is revisited to capitalize on the availability of constraints and the reduced dimensional space. We modify DBSCAN with two new characteristics for story clustering: 1) constraint-based centroid selection and 2) adaptive radius. Experiments on TRECVID-2004 corpus demonstrate that CCC with visual constraints is more capable of mining news topics of varying densities, shapes and sizes, compared with traditional k-means, DBSCAN, and spectral co-clustering algorithms.
引用
收藏
页码:188 / 199
页数:12
相关论文
共 39 条
[1]  
AGGARWAL CC, 2003, P C VER LARG DAT BAS
[2]  
Allan J, 2002, TOPIC DETECTION TRAC
[3]  
Ankerst M, 1999, SIGMOD RECORD, VOL 28, NO 2 - JUNE 1999, P49
[4]  
[Anonymous], ACM MULTIMEDIA
[5]  
[Anonymous], P 14 ACM INT C MULT
[6]  
[Anonymous], 1996, P 2 INT C KNOWL DISC
[7]  
[Anonymous], 2001, ICML, DOI DOI 10.1109/TPAMI.2002.1017616
[8]  
[Anonymous], 2004, P 12 ANN ACM INT C M, DOI DOI 10.1145/1027527.1027730
[9]  
[Anonymous], P MULT ACM, DOI DOI 10.1145/1101149.1101167
[10]  
Brants T., 2003, P 26 ANN INT ACM SIG, P330, DOI DOI 10.1145/860435.860495