Tools for Placing Cuts and Transitions in Interview Video

被引:50
作者
Berthouzoz, Floraine [1 ]
Li, Wilmot
Agrawala, Maneesh [1 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
来源
ACM TRANSACTIONS ON GRAPHICS | 2012年 / 31卷 / 04期
基金
美国国家科学基金会;
关键词
Interaction; Human-Computer Interfaces;
D O I
10.1145/2185520.2185563
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We present a set of tools designed to help editors place cuts and create transitions in interview video. To help place cuts, our interface links a text transcript of the video to the corresponding locations in the raw footage. It also visualizes the suitability of cut locations by analyzing the audio/visual features of the raw footage to find frames where the speaker is relatively quiet and still. With these tools editors can directly highlight segments of text, check if the endpoints are suitable cut locations and if so, simply delete the text to make the edit. For each cut our system generates visible (e.g. jump-cut, fade, etc.) and seamless, hidden transitions. We present a hierarchical, graph-based algorithm for efficiently generating hidden transitions that considers visual features specific to interview footage. We also describe a new data-driven technique for setting the timing of the hidden transition. Finally, our tools offer a one click method for seamlessly removing 'ums' and repeated words as well as inserting natural-looking pauses to emphasize semantic content. We apply our tools to edit a variety of interviews and also show how they can be used to quickly compose multiple takes of an actor narrating a story.
引用
收藏
页数:8
相关论文
共 36 条
  • [1] Abel J., 1999, RADIO ILLUSTRATED GU
  • [2] Panoramic video textures
    Agarwala, A
    Zheng, KC
    Pal, C
    Agrawala, M
    Cohen, M
    Curless, B
    Salesin, D
    Szeliski, R
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2005, 24 (03): : 821 - 827
  • [3] An optimal algorithm for approximate nearest neighbor searching in fixed dimensions
    Arya, S
    Mount, DM
    Netanyahu, NS
    Silverman, R
    Wu, AY
    [J]. JOURNAL OF THE ACM, 1998, 45 (06) : 891 - 923
  • [4] Athitsos V, 2004, PROC CVPR IEEE, P268
  • [5] A morphable model for the synthesis of 3D faces
    Blanz, V
    Vetter, T
    [J]. SIGGRAPH 99 CONFERENCE PROCEEDINGS, 1999, : 187 - 194
  • [6] Comparison of video shot boundary detection techniques
    Boreczky, JS
    Rowe, LA
    [J]. JOURNAL OF ELECTRONIC IMAGING, 1996, 5 (02) : 122 - 128
  • [7] BREGLER C, 1995, FIFTH INTERNATIONAL CONFERENCE ON COMPUTER VISION, PROCEEDINGS, P494, DOI 10.1109/ICCV.1995.466899
  • [8] Bregler C., 1997, Computer Graphics Proceedings, SIGGRAPH 97, P353, DOI 10.1145/258734.258880
  • [9] High accuracy optical flow estimation based on a theory for warping
    Brox, T
    Bruhn, A
    Papenberg, N
    Weickert, J
    [J]. COMPUTER VISION - ECCV 2004, PT 4, 2004, 2034 : 25 - 36
  • [10] Casares Juan, 2002, P 4 C DESIGNING INTE, P157