Direction of arrival estimation improvement of speech on a two-microphone array

被引:0
作者
Nor, Mohd Nadzrul Bin Mohd [1 ]
Matsumura, Tomoya [1 ]
Onoye, Takao [1 ]
机构
[1] Osaka Univ, Grad Sch Informat Sci & Technol, Osaka, Japan
来源
PROCEEDINGS OF THE NINTH IASTED INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING | 2007年
关键词
acoustic; speech processing; speech direction of arrival; vowel harmonic structure; and two-microphone array;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper proposes methods to achieve improved accuracy of speech direction of arrival estimation. Previous research has proposed a high resolution DOA estimation system for human vowels using only two microphones. However, in real environment, the conventional DOA estimation system is not robust enough to provide accurate results for human speech. To increase the robustness of the system for speech, non-utterance frame omission method and steering frequency selection method are proposed. Non-utterance frame omission evaluates the strength of speech in each frame and omits frames that have no or weak speech presence. Steering frequency selection is applied to determine the frequency that is imperative for DOA estimation based on harmonic product spectrum. Finally, the proposed system is evaluated both through simulation and real environment test. Proposed system shows a distinct improvement for speech DOA estimation amounting to about 46% decrease in estimation error compared to the conventional system for sound sources present at the side of the array.
引用
收藏
页码:129 / 135
页数:7
相关论文
共 17 条
  • [1] FLANAGAN JL, 1985, P IEEE, V73, P732
  • [2] HIOKA Y, 2004, DENSHI JOHO GAKKAI R
  • [3] HIOKA YS, 2004, IEICE T FUNDAMENTAL
  • [4] JIAN WR, 2006, ADV DIRECTION ARRIVA
  • [5] KELLERMAN W, 1992, P ICASSP 92, P304
  • [6] KHALIL F, 1994, J AUDIO ENG SOC, V42, P691
  • [7] KIKUCHI T, 2004, DSP98164 IEICE
  • [8] LIN Q, 1996, P ICASSP 96 ATL GA, P21
  • [9] NAKADAI K, 2002, P IEEE INT C SPOK LA
  • [10] Omologo M, 2001, DIGITAL SIGNAL PROC, P331