Information-theoretic temporal segmentation of video and applications: multiscale keyframes selection and shot boundaries detection

被引:15
作者
Janvier, Bruno [1 ]
Bruno, Eric [1 ]
Pun, Thierry [1 ]
Marchand-Maillet, Stephane [1 ]
机构
[1] Univ Geneva, Comp Vis & Multimedia Lab, Viper Grp, Geneva, Switzerland
关键词
content-based video analysis; temporal segmentation; keyframe selection; detection of shot boundaries;
D O I
10.1007/s11042-006-0026-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The first step in the analysis of video content is the partitioning of a long video sequence into short homogeneous temporal segments. The homogeneity property ensures that the segments are taken by a single camera and represent a continuous action in time and space. These segments can then be used as atomic temporal components for higher level analysis like browsing, classification, indexing and retrieval. The novelty of our approach is to use color information to partition the video into segments dynamically homogeneous using a criterion inspired by compact coding theory. We perform an information-based segmentation using a Minimum Message Length (MML) criterion and minimization by a Dynamic Programming Algorithm (DPA). We show that our method is efficient and robust to detect all types of transitions in a generic manner. A specific detector for each type of transition of interest therefore becomes unnecessary. We illustrate our technique by two applications: a multiscale keyframe selection and a generic shot boundaries detection.
引用
收藏
页码:273 / 288
页数:16
相关论文
共 12 条
  • [1] [Anonymous], P TRECVID 2003 WORKS
  • [2] BAXTER RA, 1994, P 4 IEEE DAT COMPR C
  • [3] FISHER W, 1958, J AM STAT ASS, V53
  • [4] FITZGIBBON OJ, 2000, P 11 INT C ALG LEARN, P56
  • [5] Performance characterization and comparison of video indexing
    Gargi, U
    Kasturi, R
    Antani, S
    [J]. 1998 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1998, : 559 - 565
  • [6] GUIGUES L, 2003, 19 C TRAIT SIGN IM G
  • [7] HANJALIC A, 2002, IEEE T CIRCUITS SYST, V12
  • [8] Shot boundary refinement for long transition in digital video sequence
    Heng, WJ
    Ngan, KN
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2002, 4 (04) : 434 - 445
  • [9] Temporal video segmentation: A survey
    Koprinska, I
    Carrato, S
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2001, 16 (05) : 477 - 500
  • [10] LIENHART R, 1999, 7 P SPIE