Semantic analysis for video contents extraction - Spotting by Association in news video

被引:29
作者
Nakamura, Y [1 ]
Kanade, T [1 ]
机构
[1] Univ Tsukuba, Inst Informat Sci & Elect, Tsukuba, Ibaraki 305, Japan
来源
ACM MULTIMEDIA 97, PROCEEDINGS | 1997年
关键词
D O I
10.1145/266180.266391
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Spotting by Association method for video analysis is a novel method to detect video segments with typical semantics. Video data contains various kinds of information through continuous images, natural language, and sound. For videos to be stored and retrieved in a Digital Library, it is essential to segment the video data, into meaningful pieces. To detect meaningful segments, we need to identify the segment in each modality (video, language, and sound) that corresponds to the same story. For this purpose, we propose a new method for making correspondences between image clues detected by image analysis and language clues detected by natural language analysis. As a result, relevant video segments with sufficient information from every modality are obtained. We applied our method to closed-captioned CNN Headline News. Video segments with important events, such as a public speech, meeting, or visit, are detected fairly well.
引用
收藏
页码:393 / 401
页数:9
相关论文
共 8 条
  • [1] [Anonymous], INT J LEXICOGRAPHY
  • [2] [Anonymous], 1993, 3 INT WORKSH PARS TE
  • [3] HAUPTMANN A, 1995, IJCAI 95 WORKSH INT
  • [4] ROWLEY H, 1996, IM UND WORKSH
  • [5] SMITH M, 1997, IEEE CVPR
  • [6] SMITH MA, 1995, AAAI FALL 1995 S COM
  • [7] Wactlar H, 1996, IEEE COMPUTER, V29
  • [8] ZHANG HJ, 1995, P ACM MULTIMEDIA