Sound Source Distance Estimation in Rooms based on Statistical Properties of Binaural Signals

被引:29
|
作者
Georganti, Eleftheria [1 ]
May, Tobias [2 ]
van de Par, Steven [3 ]
Mourjopoulos, John [1 ]
机构
[1] Univ Patras, Audio & Acoust Technol Grp, Wire Commun Lab, Elect & Comp Engn Dept, Patras 26500, Greece
[2] Tech Univ Denmark, Ctr Appl Hearing Res, Dept Elect Engn, DK-2800 Lyngby, Denmark
[3] Carl von Ossietzky Univ Oldenburg, Inst Phys, D-26111 Oldenburg, Germany
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2013年 / 21卷 / 08期
关键词
spectral standard deviation; Binaural distance estimation; room transfer functions; REVERBERANT ENERGY RATIO; COHERENCE;
D O I
10.1109/TASL.2013.2260155
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A novel method for the estimation of the distance of a sound source from binaural speech signals is proposed. The method relies on several statistical features extracted from such signals and their binaural cues. Firstly, the standard deviation of the difference of the magnitude spectra of the left and right binaural signals is used as a feature for this method. In addition, an extended set of additional statistical features that can improve distance detection is extracted from an auditory front-end which models the peripheral processing of the human auditory system. The method incorporates the above features into two classification frameworks based on Gaussian mixture models and Support Vector Machines and the relative merits of those frameworks are evaluated. The proposed method achieves distance detection when tested in various acoustical environments and performs well in unknown environments. Its performance is also compared to an existing binaural distance detection method.
引用
收藏
页码:1727 / 1741
页数:15
相关论文
共 50 条
  • [1] Binaural Sound Source Distance Learning in Rooms
    Vesa, Sampo
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (08): : 1498 - 1507
  • [2] Sound source distance learning based on binaural signals
    Vesa, Sampo
    2007 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 2007, : 9 - 12
  • [3] Binaural Sound Source Distance Estimation and Localization for a Moving Listener
    Krause, Daniel Aleksander
    Garcia-Barrios, Guillermo
    Politis, Archontis
    Mesaros, Annamaria
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 996 - 1011
  • [4] Binaural sound source localization in real and virtual rooms
    Laboratory of Acoustics and Thermal Physics, Katholieke Universiteit Leuven, 3001 Heverlee, Belgium
    不详
    不详
    AES J Audio Eng Soc, 2009, 4 (205-220):
  • [5] Binaural Sound Source Localization in Real and Virtual Rooms
    Rychtarikova, Monika
    Van den Bogaert, Tim
    Vermeir, Gerrit
    Wouters, Jan
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2009, 57 (04): : 205 - 220
  • [6] Room Volume Estimation Based on Statistical Properties of Binaural Signals Using Humanoid Robot
    Shimoyama, Ryuichi
    Fukuda, Reo
    2014 23RD INTERNATIONAL CONFERENCE ON ROBOTICS IN ALPE-ADRIA-DANUBE REGION (RAAD), 2014,
  • [7] Full Sound Source Localization of Binaural Signals
    Venkatesan, R.
    Ganesh, A. Balaji
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2017, : 213 - 217
  • [8] A study on distance estimation in binaural sound localization
    Rodemann, Tobias
    IEEE/RSJ 2010 INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2010), 2010, : 425 - 430
  • [9] Analysis of Monaural and Binaural Statistical Properties for the Estimation of Distance of a Target Speaker
    Venkatesan, R.
    Ganesh, A. Balaji
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2020, 39 (07) : 3626 - 3651
  • [10] Analysis of Monaural and Binaural Statistical Properties for the Estimation of Distance of a Target Speaker
    R. Venkatesan
    A. Balaji Ganesh
    Circuits, Systems, and Signal Processing, 2020, 39 : 3626 - 3651