A robust method for estimating synchronization and delay of audio and video for communication services

被引:0
作者
Andreas Rossholm
Benny Lövström
机构
[1] Blekinge Institute of Technology,
来源
Multimedia Tools and Applications | 2016年 / 75卷
关键词
Lip sync; Synchronization; Delay; QoE; Video streaming; Video conferencing;
D O I
暂无
中图分类号
学科分类号
摘要
One of the main contributions to the quality of experience in streaming services or in two-way communication of audio and video applications is synchronization. This has been shown in several studies and experiments but methods to measure synchronization are less frequent, especially for situations without internal access to the application and independent of platform and device. In this paper we present a method for measuring synchronization skewness as well as delay for audio and video. The solution incorporates audio and video reference streams, where audio and video frames are marked with frame numbers which are decoded on the receiver side to enable calculation of synchronization and delay. The method has been verified in a two-way communication application in a transparent network with and without inserting known delays, as well as in a network with 5 and 10 % packet loss levels. The method can be used for both streaming and two-way communication services, both with and without access to the internal structures, and enables measurements of applications running on e.g. smartphones, tablets, and laptops under various conditions.
引用
收藏
页码:527 / 545
页数:18
相关论文
共 33 条
  • [1] Blakowski G(1996)A media synchronization survey: reference model, specification, and case studies IEEE Sel Areas Commun 14 5-35
  • [2] Steinmetz R(2003)GSM TDMA frame rate internal active noise cancellation Int J Acoust Vib 8 159-166
  • [3] Claesson I(1999)Multi-modal perception BT Technol J 17 35-46
  • [4] Rossholm (formerly Nilsson) A(2013)Evolution of temporal multimedia synchronization principles: a historical viewpoint ACM Trans Multimedia Comput Commun Appl (TOMCCAP) 9 40-47
  • [5] Hollier MP(1997)A survey of combinatorial gray codes SIAM Rev 39 605-629
  • [6] Rimell AN(2011)Peer-to-peer media streaming: insights and new developments Proc IEEE 99 2089-2109
  • [7] Hands DS(1996)Human perception of jitter and media synchronization IEEE Sel Areas Commun 14 61-72
  • [8] Voelcker RM(2008)The evolution of video quality measurement: from psnr to hybrid metrics IEEE Trans Broadcast 54 660-668
  • [9] Huang Z(2010)Perceptual-based quality assessment for audiovisual services: a survey Signal Process Image Commun 25 482-501
  • [10] Nahrstedt K(2011)Joint forensics-scheduling strategy for delay-sensitive multimedia applications over heterogeneous networks IEEE J Sel Areas Commun 29 1358-1367