QuickCut: An Interactive Tool for Editing Narrated Video

被引:44
作者
Anh Truong [1 ]
Berthouzoz, Floraine [1 ]
Li, Wilmot [1 ]
Agrawala, Maneesh [2 ]
机构
[1] Adobe Res, San Jose, CA 95110 USA
[2] Stanford Univ, Stanford, CA 94305 USA
来源
UIST 2016: PROCEEDINGS OF THE 29TH ANNUAL SYMPOSIUM ON USER INTERFACE SOFTWARE AND TECHNOLOGY | 2016年
关键词
video editing; video annotation; video segmentation;
D O I
10.1145/2984511.2984569
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We present QuickCut, an interactive video editing tool designed to help authors efficiently edit narrated video. QuickCut takes an audio recording of the narration voiceover and a collection of raw video footage as input. Users then review the raw footage and provide spoken annotations describing the relevant actions and objects in the scene. QuickCut time-aligns a transcript of the annotations with the raw footage and a transcript of the narration to the voiceover. These aligned transcripts enable authors to quickly match story events in the narration with semantically relevant video segments and form alignment constraints between them. Given a set of such constraints, QuickCut applies dynamic programming optimization to choose frame-level cut points between the video segments while maintaining alignments with the narration and adhering to low-level film editing guidelines. We demonstrate QuickCut's effectiveness by using it to generate six (<= 2 minutes) narrated videos. Each result required between 14 and 52 minutes of user time to edit (i.e. between 8 and 31 minutes for each minute of output video), which is far less than typical authoring times with existing video editing workflows.
引用
收藏
页码:497 / 507
页数:11
相关论文
共 24 条
  • [1] [Anonymous], 2015, ARXIV150606724
  • [2] [Anonymous], 2014, UIST, DOI DOI 10.1145/2642918.2647400
  • [3] [Anonymous], 2008, Introduction to information retrieval
  • [4] Automatic Editing of Footage from Multiple Social Cameras
    Arev, Ido
    Park, Hyun Soo
    Sheikh, Yaser
    Hodgins, Jessica
    Shamir, Ariel
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2014, 33 (04):
  • [5] Bao Xuan., 2010, PROC MOBISYS 2010, P357, DOI DOI 10.1145/1814433.1814468
  • [6] Tools for Placing Cuts and Transitions in Interview Video
    Berthouzoz, Floraine
    Li, Wilmot
    Agrawala, Maneesh
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2012, 31 (04):
  • [7] BIRD S, 2006, P COLING ACL INT PRE, P69, DOI DOI 10.3115/1225403.1225421
  • [8] Chi P.-Y., 2013, P 26 ANN ACM S US IN, P141, DOI DOI 10.1145/2501988.2502052
  • [9] Cour T, 2008, LECT NOTES COMPUT SC, V5305, P158, DOI 10.1007/978-3-540-88693-8_12
  • [10] RANDOM SAMPLE CONSENSUS - A PARADIGM FOR MODEL-FITTING WITH APPLICATIONS TO IMAGE-ANALYSIS AND AUTOMATED CARTOGRAPHY
    FISCHLER, MA
    BOLLES, RC
    [J]. COMMUNICATIONS OF THE ACM, 1981, 24 (06) : 381 - 395