Binaural Sound Source Distance Learning in Rooms

被引:45
作者
Vesa, Sampo [1 ]
机构
[1] Aalto Univ, Dept Media Technol, FI-02015 Espoo, Finland
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2009年 / 17卷 / 08期
基金
芬兰科学院;
关键词
Binaural signal; coherence; distance measurement; localization; CROSS-CORRELATION MODEL; CONTRALATERAL INHIBITION; PERCEPTION; LOCALIZATION; EXTENSION;
D O I
10.1109/TASL.2009.2022001
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A method for learning the distance of a sound source in a room is presented. The proposed method is based on short-time magnitude-squared coherence between the two channels of a binaural signal. Based on white noise as the training data, a coherence profile is obtained at each desired position in the room. These profiles can then be used to identify the most likely distance of a speech signal in the same room. The proposed approach is compared to a previous method for learning the position of a sound source. The results indicate that the both methods are able to identify the distance of a speech sound source correctly in a grid with 0.5-m spacing in most cases, when the orientation of the listener is 0 degrees, 30 degrees, 60 degrees, 90 degrees, or 180 degrees on the horizontal plane.
引用
收藏
页码:1498 / 1507
页数:10
相关论文
共 38 条
[1]   MULTI-MICROPHONE SIGNAL-PROCESSING TECHNIQUE TO REMOVE ROOM REVERBERATION FROM SPEECH SIGNALS [J].
ALLEN, JB ;
BERKLEY, DA ;
BLAUERT, J .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1977, 62 (04) :912-915
[2]  
[Anonymous], 2007, APPL SIGN PROC AUD A
[3]  
Backman J., 1993, ICASSP-93. 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing (Cat. No.92CH3252-4), P125, DOI 10.1109/ICASSP.1993.319071
[4]  
Bronkhorst A. W., 2002, P FOR AC SEV SPAIN S
[5]   Auditory distance perception in rooms [J].
Bronkhorst, AW ;
Houtgast, T .
NATURE, 1999, 397 (6719) :517-520
[6]   The effects of production and presentation level on the auditory distance perception of speech [J].
Brungart, DS ;
Scott, KR .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2001, 110 (01) :425-440
[7]   BREAKDOWN OF ECHO SUPPRESSION IN THE PRECEDENCE EFFECT [J].
CLIFTON, RK .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1987, 82 (05) :1834-1835
[8]   Source localization in complex listening situations: Selection of binaural cues based on interaural coherence [J].
Faller, C ;
Merimaa, J .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2004, 116 (05) :3075-3089
[9]   COMBINED EVALUATION OF INTERAURAL TIME AND INTENSITY DIFFERENCES - PSYCHOACOUSTIC RESULTS AND COMPUTER MODELING [J].
GAIK, W .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1993, 94 (01) :98-110
[10]  
Griesinger D., 2004, P INT C AC KYOT JAP, V1, P29