Semantic analysis for video contents extraction - Spotting by Association in news video

被引：29

作者：

Nakamura, Y ^{[1
]}

Kanade, T ^{[1
]}

机构：

[1] Univ Tsukuba, Inst Informat Sci & Elect, Tsukuba, Ibaraki 305, Japan

来源：

ACM MULTIMEDIA 97, PROCEEDINGS | 1997年

关键词：

D O I：

10.1145/266180.266391

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Spotting by Association method for video analysis is a novel method to detect video segments with typical semantics. Video data contains various kinds of information through continuous images, natural language, and sound. For videos to be stored and retrieved in a Digital Library, it is essential to segment the video data, into meaningful pieces. To detect meaningful segments, we need to identify the segment in each modality (video, language, and sound) that corresponds to the same story. For this purpose, we propose a new method for making correspondences between image clues detected by image analysis and language clues detected by natural language analysis. As a result, relevant video segments with sufficient information from every modality are obtained. We applied our method to closed-captioned CNN Headline News. Video segments with important events, such as a public speech, meeting, or visit, are detected fairly well.

引用

页码：393 / 401

页数：9