Whiteboard Video Summarization via Spatio-Temporal Conflict Minimization

被引：15

作者：

Davila, Kenny ^{[1
]}

Zanibbi, Richard ^{[1
]}

机构：

[1] Rochester Inst Technol, Dept Comp Sci, Rochester, NY 14623 USA

来源：

2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1 | 2017年

基金：

美国国家科学基金会;

关键词：

CONTENT EXTRACTION; LECTURE;

D O I：

10.1109/ICDAR.2017.66

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Lecture videos are a valuable resource for students, and thanks to online sources they have become widely available. The ability to find videos based on their content could make them even more useful. Methods for automatic extraction of this content reduce the amount of manual effort required to make indexing and retrieval of such videos possible. We present a method that generates static image summaries of handwritten whiteboard content from lecture videos recorded with still cameras. We generate a spatio-temporal index for the handwritten content in the video, and we use it for temporal segmentation by detecting and removing conflicts between content regions. Resulting segments are used to produce key-frame based summaries. Our method has been tested on a video collection showing promising results for automatic lecture video summarization with good compression ratios and 96.28% recall of connected components in test videos.

引用

页码：355 / 362

页数：8

共 36 条

[1]

Adcock John., 2010, Proceedings of the international conference on Multimedia, MM '10, P241, DOI DOI 10.1145/1873951.1873986

[2]

[Anonymous], 2012, P SIGCHI C HUM FACT, DOI [DOI 10.1145/2207676.2207767, 10.1145/2207676.22077672, DOI 10.1145/2207676.22077672]

[3]

[Anonymous], P INT C FRONT HANDWR

[4] Automatic Detection of Handwritten Texts from Video Frames of Lectures [J].