Automated video summarization using speech transcripts

被引:0
作者
Taskiran, CM [1 ]
Amir, A [1 ]
Ponceleon, D [1 ]
Delp, EJ [1 ]
机构
[1] Purdue Univ, Sch Elect & Comp Engn, VIPER, Video & Image Proc Lab, W Lafayette, IN 47907 USA
来源
STORAGE AND RETRIEVAL FOR MEDIA DATABASES 2002 | 2002年 / 4676卷
关键词
video summarization; speech analysis; video databases; content-based video analysis; video skimming; segmentation of speech transcripts; summary evaluation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Compact representations of video data can enable efficient video browsing. Such representations provide the user with information about tile content of the particular sequence being examined while preserving the essential message. We propose a method to automatically generate video summaries for long videos. Our video summarization approach involves mainly two tasks: first, segmenting the video into small, coherent segments and second, ranking the resulting segments. Our proposed algorithm scores segments based on word frequency analysis of speech transcripts. Then a summary is generated by selecting tile segments with the highest score to duration ratios and these are concatenating them. We have designed and performed a user study to evaluate the quality of summaries generated. Comparisons are made using our proposed algorithm and a random segment selection scheme based on statistical analysis of the user study results. Finally we discuss various issues that arise in summary evaluation with user studies.
引用
收藏
页码:371 / 382
页数:12
相关论文
共 32 条
[1]   Summarization of video programs based on closed captions [J].
Agnihotri, L ;
Devara, K ;
McGee, T ;
Dimitrova, N .
STORAGE AND RETRIEVAL FOR MEDIA DATABASES 2001, 2001, 4315 :599-607
[2]  
AMIR A, 2000, P 33 HAW INT C SYST
[3]  
[Anonymous], INFORM RETRIEVAL 93
[4]  
BAHL LR, 1995, INT CONF ACOUST SPEE, P41, DOI 10.1109/ICASSP.1995.479268
[5]  
CHRISTEL M, 1999, P IEEE C ADV DIG LIB
[6]  
Christel M. G., 1998, CHI 98. Human Factors in Computing Systems. CHI 98 Conference Proceedings, P171, DOI 10.1145/274644.274670
[7]  
DeMenthon D., 1998, Proceedings ACM Multimedia 98, P211, DOI 10.1145/290747.290773
[8]  
Dunning T., 1993, Computational Linguistics, V19, P61
[9]  
HE L, 2000, C HUM FACT COMP SYST, P177
[10]   Auto-summarization of audio-video presentations [J].
He, LW ;
Sanocki, E ;
Gupta, A ;
Grudin, J .
ACM MULTIMEDIA 99, PROCEEDINGS, 1999, :489-498