DeepEar: Sound Localization With Binaural Microphones

被引:7
|
作者
Yang, Qiang [1 ]
Zheng, Yuanqing [1 ]
机构
[1] Hong Kong Polytech Univ, Dept Comp, Kowloon, Hong Kong, Peoples R China
关键词
Binaural localization; multi-source localization; earable computing; NEURAL-NETWORKS; HEAD MOVEMENTS; NOISE; DIFFERENCE; FEATURES; SEARCH;
D O I
10.1109/TMC.2022.3222821
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The binaural microphone, which refers to a pair of microphones with artificial human-shaped ears, is widely used in hearing aids and spatial audio recording to improve sound quality. It is crucial for such devices to find the voice direction in many applications such as binaural sound enhancement. However, sound localization with two microphones remains challenging, especially in multi-source scenarios. Most previous work utilized microphone arrays to deal with the multi-source localization problem. Extra microphones yet have space constraints for deployment in many scenarios (e.g., hearing aids). Inspired by the fact that humans have evolved to locate multiple sound sources with only two ears, we propose DeepEar, a binaural microphone-based sound localization system. To this end, we design a multisector-based neural network to locate multiple sound sources simultaneously, where each sector is a discretized region of the space for different angle of arrivals. DeepEar fuses explicit hand-crafted features and implicit latent sound representatives to facilitate sound localization. More importantly, the trained DeepEar model can adapt to new environments with a minimum amount of extra training data. The experiment results show that DeepEar substantially outperforms the state-of-the-art binaural deep learning approach by a large margin in terms of sound detection accuracy and azimuth estimation error.
引用
收藏
页码:359 / 375
页数:17
相关论文
共 50 条
  • [31] Binaural sound source localization in real and virtual rooms
    Laboratory of Acoustics and Thermal Physics, Katholieke Universiteit Leuven, 3001 Heverlee, Belgium
    不详
    不详
    AES J Audio Eng Soc, 2009, 4 (205-220):
  • [32] Binaural Sound Localization based on Sparse Coding and SOM
    Kim, Hong Shik
    Choi, Jongsuk
    2009 IEEE-RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2009, : 2557 - 2562
  • [33] Model and application of a binaural 360° sound localization system
    Schauer, C
    Gross, HM
    IJCNN'01: INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2001, : 1132 - 1137
  • [34] Effects of visual images on sound localization with binaural reproduction
    Kuramochi, Toshikatsu
    Ayama, Miyoshi
    Takahashi, Kazuhiro
    Hasegawa, Hiroshi
    Mekada, Yoshito
    Kasuga, Masao
    Kyokai Joho Imeji Zasshi/Journal of the Institute of Image Information and Television Engineers, 2000, 54 (09): : 1350 - 1355
  • [35] Modeling the utility of binaural cues for underwater sound localization
    Schneider, Jennifer N.
    Lloyd, David R.
    Banks, Patchouly N.
    Mercado, Eduardo, III
    HEARING RESEARCH, 2014, 312 : 103 - 113
  • [36] Maximum likelihood sound source localization for multiple directional microphones
    Zhang, Cha
    Zhang, Zhengyou
    Florencio, Dinei
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PTS 1-3, PROCEEDINGS, 2007, : 125 - 128
  • [37] An approach for sound sources localization and characterization using array of microphones
    Wang, Tiangang
    Choy, Yatsze
    2015 INTERNATIONAL CONFERENCE ON NOISE AND FLUCTUATIONS (ICNF), 2015,
  • [38] Effect of reflectors on sound-source localization with two microphones
    Phatak, Sandeep A.
    Ratnam, Rama
    Wheeler, Bruce C.
    O'Brien Jr., William D.
    Feng, Albert
    AES: Journal of the Audio Engineering Society, 2006, 54 (06): : 512 - 524
  • [39] Effect of reflectors on sound-source localization with two microphones
    Phatak, SA
    Ratnam, R
    Wheeler, BC
    O'Brien, WD
    Feng, A
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2006, 54 (06): : 512 - 524
  • [40] BINAURAL AND MONAURAL LOCALIZATION OF SOUND IN 2-DIMENSIONAL SPACE
    BUTLER, RA
    HUMANSKI, RA
    MUSICANT, AD
    PERCEPTION, 1990, 19 (02) : 241 - 256