Non-Uniform Microphone Arrays for Robust Speech Source Localization for Smartphone-Assisted Hearing Aid Devices

被引:5
作者
Ganguly, Anshuman [1 ]
Panahi, Issa [1 ]
机构
[1] Univ Texas Dallas, Dept Elect Engn, Richardson, TX 75080 USA
来源
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY | 2018年 / 90卷 / 10期
基金
美国国家卫生研究院;
关键词
Non-uniform microphone arrays; Speech source localization; Hearing aid devices; Smartphone; Low SNR;
D O I
10.1007/s11265-017-1297-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Robust speech source localization (SSL) is an important component of the speech processing pipeline for hearing aid devices (HADs). SSL via time direction of arrival (TDOA) estimation has been known to improve performance of HADs in noisy environments, thereby providing better listening experience for hearing aid users. Smartphones now possess the capability to connect to the HADs through wired or wireless channel. In this paper, we present our findings about the non-uniform non-linear microphone array (NUNLA) geometry for improving SSL for HADs using an L-shaped three-element microphone array available on modern smartphones. The proposed method is implemented on a frame-based TDOA estimation algorithm using a modified Dictionary-based singular value decomposition method (SVD) method for localizing single speech sources under very low signal to noise ratios (SNR). Unlike most methods developed for uniform microphone arrays, the proposed method has low spatial aliasing as well as low spatial ambiguity while providing a robust low-error with 360A degrees DOA scanning capability. We present the comparison among different types of microphone arrays, as well as compare their performance using the proposed method.
引用
收藏
页码:1415 / 1435
页数:21
相关论文
共 32 条
  • [1] [Anonymous], 2007, ACOUSTIC SOURCE LOCA
  • [2] [Anonymous], 2015, PHONAK ADV BIONICS
  • [3] Brandstein M., 2013, Microphone Arrays: Signal Processing Techniques and Applications
  • [4] Brandstein M, 1995, THESIS
  • [5] Byrne D, 1998, Trends Amplif, V3, P51, DOI 10.1177/108471389800300202
  • [6] Efficient maximum likelihood DOA estimation for signals with known waveforms in the presence of multipath
    Cedervall, M
    Moses, RL
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1997, 45 (03) : 808 - 811
  • [7] On Spatial Aliasing in Microphone Arrays
    Dmochowski, Jacek
    Benesty, Jacob
    Affes, Sofiene
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2009, 57 (04) : 1383 - 1395
  • [8] A generalized steered response power method for computationally viable source localization
    Dmochowski, Jacek P.
    Benesty, Jacob
    Affes, Sofiene
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (08): : 2510 - 2526
  • [9] Improving Sound Localization for Hearing Aid Devices using Smartphone Assisted Technology
    Ganguly, Anshuman
    Reddy, Chandan
    Hao, Yiya
    Panahi, Issa
    [J]. 2016 IEEE INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS), 2016, : 165 - 170
  • [10] Kamiyanagida H, 2001, AC SPEECH SIGN PROC, V5