Whiteboard Video Summarization via Spatio-Temporal Conflict Minimization

被引:15
作者
Davila, Kenny [1 ]
Zanibbi, Richard [1 ]
机构
[1] Rochester Inst Technol, Dept Comp Sci, Rochester, NY 14623 USA
来源
2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1 | 2017年
基金
美国国家科学基金会;
关键词
CONTENT EXTRACTION; LECTURE;
D O I
10.1109/ICDAR.2017.66
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Lecture videos are a valuable resource for students, and thanks to online sources they have become widely available. The ability to find videos based on their content could make them even more useful. Methods for automatic extraction of this content reduce the amount of manual effort required to make indexing and retrieval of such videos possible. We present a method that generates static image summaries of handwritten whiteboard content from lecture videos recorded with still cameras. We generate a spatio-temporal index for the handwritten content in the video, and we use it for temporal segmentation by detecting and removing conflicts between content regions. Resulting segments are used to produce key-frame based summaries. Our method has been tested on a video collection showing promising results for automatic lecture video summarization with good compression ratios and 96.28% recall of connected components in test videos.
引用
收藏
页码:355 / 362
页数:8
相关论文
共 36 条
[1]  
Adcock John., 2010, Proceedings of the international conference on Multimedia, MM '10, P241, DOI DOI 10.1145/1873951.1873986
[2]  
[Anonymous], 2012, P SIGCHI C HUM FACT, DOI [DOI 10.1145/2207676.2207767, 10.1145/2207676.22077672, DOI 10.1145/2207676.22077672]
[3]  
[Anonymous], P INT C FRONT HANDWR
[4]   Automatic Detection of Handwritten Texts from Video Frames of Lectures [J].
Banerjee, Purnendu ;
Bhattacharya, Ujjwal ;
Chaudhuri, Bidyut B. .
2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, :627-632
[5]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[6]  
Chang HS, 1999, IEEE T CIRC SYST VID, V9, P1269, DOI 10.1109/76.809161
[7]   Extracting content from instructional videos by statistical modelling and classification [J].
Choudary, Chekuri ;
Liu, Tiecheng .
PATTERN ANALYSIS AND APPLICATIONS, 2007, 10 (02) :69-81
[8]   Summarization of visual content in instructional videos [J].
Choudary, Chekuri ;
Liu, Tiecheng .
IEEE TRANSACTIONS ON MULTIMEDIA, 2007, 9 (07) :1443-1455
[9]   Mean shift: A robust approach toward feature space analysis [J].
Comaniciu, D ;
Meer, P .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (05) :603-619
[10]  
Davila K, 2013, WEST NY IMAGE PROCES, P14, DOI 10.1109/WNYIPW.2013.6890981