Summarization of visual content in instructional videos

被引:38
作者
Choudary, Chekuri [1 ]
Liu, Tiecheng
机构
[1] Univ S Carolina, Dept Comp Sci & Engn, Columbia, SC 29208 USA
[2] Univ So Calif, Inst Informat Sci, Arlington, VA USA
关键词
E-1; earning; instructional video analysis; key frame selection;
D O I
10.1109/TMM.2007.906602
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In instructional videos of chalk board presentations, the visual content refers to the text and figures written on the boards. Existing methods on video summarization are not effective for this video domain because they are mainly based on low-level image features such as color and edges. In this work, we present a novel approach to summarizing the visual content in instructional videos using middle-level features. We first develop a robust algorithm to extract content text and figures from instructional videos by statistical modelling and clustering. This algorithm addresses the image noise, nonuniformity of the board regions, camera movements, occlusions, and other challenges in the instructional videos that are recorded in real classrooms. Using the extracted text and figures as the middle level features, we retrieve a set of key frames that contain most of the visual content. We further reduce content redundancy and build a mosaicked summary image by matching extracted content based on K-th Hausdorff distance and connected component decomposition. Performance evaluation on four full-length instructional videos shows that our algorithm is highly effective in summarizing instructional video content.
引用
收藏
页码:1443 / 1455
页数:13
相关论文
共 50 条
[1]  
ALTMAN E, 2002, ACM MULTIMEDIA, P416
[2]   Automatic key frame selection using a wavelet based approach [J].
Campisi, P ;
Longari, A ;
Neri, A .
WAVELET APPLICATIONS IN SIGNAL AND IMAGE PROCESSING VII, 1999, 3813 :861-872
[3]  
Chang HS, 1999, IEEE T CIRC SYST VID, V9, P1269, DOI 10.1109/76.809161
[4]  
Chen Y, 2003, PROCEEDINGS OF THE 2003 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL II, P568
[5]   Extracting content from instructional videos by statistical modelling and classification [J].
Choudary, Chekuri ;
Liu, Tiecheng .
PATTERN ANALYSIS AND APPLICATIONS, 2007, 10 (02) :69-81
[6]   Mean shift: A robust approach toward feature space analysis [J].
Comaniciu, D ;
Meer, P .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (05) :603-619
[7]  
DeMenthon D., 1998, Proceedings ACM Multimedia 98, P211, DOI 10.1145/290747.290773
[8]  
Divakaran A, 2002, IEEE IMAGE PROC, P932
[9]  
DORAI C, 2003, P ICIP 2003, V3, P1029
[10]  
Erol B., 2005, IEEE INT C MULT EXP