SIGNAL-INFORMED DNN-BASED DOA ESTIMATION COMBINING AN EXTERNAL MICROPHONE AND GCC-PHAT FEATURES

被引:3
|
作者
Kowalk, Ulrik [1 ]
Doclo, Simon [2 ,3 ]
Bitzer, Joerg [1 ]
机构
[1] Jade Univ Appl Sci, Inst Hearing Technol & Audiol, Oldenburg, Germany
[2] Carl von Ossietzky Univ Oldenburg, Dept Med Phys & Acoust, Oldenburg, Germany
[3] Carl von Ossietzky Univ Oldenburg, Cluster Excellence Hearing4all, Oldenburg, Germany
来源
2022 INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC 2022) | 2022年
关键词
signal-informed; source localization; GCC-PHAT; binary masking; external microphone; LOCALIZATION;
D O I
10.1109/IWAENC53105.2022.9914754
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Aiming at estimating the direction of arrival (DOA) of a desired speaker in a multi-talker environment using a microphone array, in this paper we propose a signal-informed method exploiting the availability of an external microphone attached to the desired speaker. The proposed method applies a binary mask to the GCC-PHAT input features of a convolutional neural network, where the binary mask is computed based on the power distribution of the external microphone signal. Experimental results for a reverberant scenario with up to four interfering speakers demonstrate that the signal-informed masking improves the localization accuracy, without requiring any knowledge about the interfering speakers.
引用
收藏
页数:5
相关论文
共 2 条
  • [1] Time Delay Estimation for Speaker Localization Using CNN-Based Parametrized GCC-PHAT Features
    Salvati, Daniele
    Drioli, Carlo
    Foresti, Gian Luca
    INTERSPEECH 2021, 2021, : 1479 - 1483
  • [2] Azimuth Estimation based on Generalized Cross Correlation Phase Transform (GCC-PHAT) using Equilateral Triangle Microphone Array
    Adritya, Catur Hilman H. B. B.
    Saputra, Hendri Maja
    2019 INTERNATIONAL CONFERENCE ON RADAR, ANTENNA, MICROWAVE, ELECTRONICS, AND TELECOMMUNICATIONS (ICRAMET), 2019, : 89 - 93