AUDIO SYNCHRONISATION WITH A TUNNEL MATRIX FOR TIME SERIES AND DYNAMIC PROGRAMMING

被引:0
作者
Gorisch, Jan [1 ,2 ,3 ]
Prevot, Laurent [1 ,2 ]
机构
[1] Aix Marseille Univ, Marseille, France
[2] CNRS, Lab Parole & Langage, Paris, France
[3] Nanyang Technol Univ, Div Linguist & Multilingual Studies, Singapore, Singapore
来源
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP) | 2015年
关键词
Audio-video Synchronisation; Image-loss Compensation; Tunnel Matrix; Tunnel DP-algorithm; Storage Requirements; RECOGNITION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Precise multimodal studies require precise synchronisation between audio and video signals. However, raw audio and audio from video recordings can be out of sync for several reasons. In order to re-synchronise them, a dynamic programming (DP) approach is presented here. Traditionally, DP is performed on the rectangular distance matrix comparing each value in signal A with each value in signal B. Previous work limited the search space using for example the Sakoe Chiba Band (Sakoe and Chiba, 1978). However, the overall space of the distance matrix remains identical. Here, a tunnel matrix and its according DP-algorithm are presented. The matrix contains merely the computed distance of two signals to a pre-specified bandwidth and the computational cost is equally reduced. An example implementation demonstrates the functionality on artificial data and on data from real audio and video recordings.
引用
收藏
页码:3846 / 3850
页数:5
相关论文
共 20 条
  • [1] Aimetti Guillaume, 2010, P INT 2010 MAK JAP
  • [2] [Anonymous], THESIS
  • [3] [Anonymous], 2007, P 16 INT C PHON SCI
  • [4] BERTRAND R, 2008, TRAITEMENT AUTOMATIQ, V49, P3
  • [5] Bertrand Roxane, 2007, P INT C AUD VIS SPEE
  • [6] Edlund Jens, 2010, P 7 INT C LANG RES E
  • [7] SPARSE MATRICES IN MATLAB - DESIGN AND IMPLEMENTATION
    GILBERT, JR
    MOLER, C
    SCHREIBER, R
    [J]. SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 1992, 13 (01) : 333 - 356
  • [8] Pitch Contour Matching and Interactional Alignment across Turns: An Acoustic Investigation
    Gorisch, Jan
    Wells, Bill
    Brown, Guy J.
    [J]. LANGUAGE AND SPEECH, 2012, 55 : 57 - 76
  • [9] Gorisch Jan, 2014, P 9 INT C LANG RES E
  • [10] Gorisch Jan, 2012, THESIS