Blind compensation of interchannel sampling frequency mismatch for ad hoc microphone array based on maximum likelihood estimation

被引:50
作者
Miyabe, Shigeki [1 ]
Ono, Nobutaka [2 ]
Makino, Shoji [1 ]
机构
[1] Univ Tsukuba, Tsukuba Adv Res Alliance, Life Sci Ctr, Tsukuba, Ibaraki 3058573, Japan
[2] Grad Univ Adv Studies SOKENDAI, Dept Informat,Sch Multidisciplinary Sci, Principles Informat Res Div,Natl Inst Informat, Chiyoda Ku, Tokyo 1018430, Japan
关键词
Ad hoc microphone array; Drift; Sampling frequency; Maximum likelihood estimation; Blind source separation;
D O I
10.1016/j.sigpro.2014.09.015
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a novel method for the blind compensation of drift for the asynchronous recording of an ad hoc microphone array. Digital signals simultaneously observed by different recording devices have drift of the time differences between the observation channels because of the sampling frequency mismatch among the devices. On the basis of a model in which the time difference is constant within each short time frame but varies in proportion to the central time of the frame, the effect of the sampling frequency mismatch can be compensated in the short-time Fourier transform (STFT) domain by a linear phase shift. By assuming that the sources are motionless and have stationary amplitudes, the observation is regarded as being stationary when drift does not occur. Thus, we formulate a likelihood to evaluate the stationarity in the STFT domain to evaluate the compensation of drift. The maximum likelihood estimation is obtained effectively by a golden section search. Using the estimated parameters, we compensate the drift by STFT analysis with a noninteger frame shift. The effectiveness of the proposed blind drift compensation method is evaluated in an experiment in which artificial drift is generated. (C) 2014 The Authors. Published by Elsevier B.V.
引用
收藏
页码:185 / 196
页数:12
相关论文
共 22 条
[1]  
[Anonymous], P IWAENC
[2]  
Bertrand A., 2011, P SCVT 2011
[3]  
Brandstein References M., 2001, MICROPHONE ARRAYS SI
[4]   Blind Estimation of Locations and Time Offsets for Distributed Recording Devices [J].
Hasegawa, Keisuke ;
Ono, Nobutaka ;
Miyabe, Shigeki ;
Sagayama, Shigeki .
LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION, 2010, 6365 :57-64
[5]  
Hoflinger F, 2012, P IPIN 2012
[6]  
Janson T., 2012, P IPIN 2012
[7]  
Javidi S., 2011, FRONT NEUROSCI, V5, P25
[8]   ATR JAPANESE SPEECH DATABASE AS A TOOL OF SPEECH RECOGNITION AND SYNTHESIS [J].
KUREMATSU, A ;
TAKEDA, K ;
SAGISAKA, Y ;
KATAGIRI, S ;
KUWABARA, H ;
SHIKANO, K .
SPEECH COMMUNICATION, 1990, 9 (04) :357-363
[9]  
Lienhart R., 2003, P ICASSP, P840
[10]  
Liu Z., 2007, P IWAENC 2007