A UNIVERSAL DEEP ROOM ACOUSTICS ESTIMATOR

被引:8
作者
Lopez, Paula Sanchez [1 ,3 ]
Callens, Paul [1 ]
Cernak, Milos [2 ]
机构
[1] Ecole Polytech Fed Lausanne, LTS2, Stn 11, CH-1015 Lausanne, Switzerland
[2] Logitech Europe SA, CH-1015 Lausanne, Switzerland
[3] Logitech, Lausanne, Switzerland
来源
2021 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA) | 2021年
关键词
Room acoustics; Convolutional Recurrent Neural Network; RT60; C50; DRR; STI; SNR;
D O I
10.1109/WASPAA52581.2021.9632738
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech audio quality is subject to degradation caused by an acoustic environment and isotropic ambient and point noises. The environment can lead to decreased speech intelligibility and loss of focus and attention by the listener. Basic acoustic parameters that characterize the environment well are (i) signal-to-noise ratio (SNR), (ii) speech transmission index, (iii) reverberation time, (iv) clarity, and (v) direct-to-reverberant ratio. Except for the SNR, these parameters are usually derived from the Room Impulse Response (RIR) measurements; however, such measurements are often not available. This work presents a universal room acoustic estimator design based on convolutional recurrent neural networks that estimate the acoustic environment measurement blindly and jointly. Our results indicate that the proposed system is robust to non-stationary signal variations and outperforms current state-of-the-art methods.
引用
收藏
页码:356 / 360
页数:5
相关论文
共 26 条
[1]  
[Anonymous], 2005, SYNTHESIS LECT SPEEC
[2]  
[Anonymous], 2009, 338212009 BS EN ISO
[3]  
Callens P., 2020, ARXIV201011167
[4]  
Clyburne-Sherin A., 2019, METAPSYCHOLOGY, V3, P892
[5]   Speech Characteristics and Intelligibility in Adults with Mild and Moderate Intellectual Disabilities [J].
Coppens-Hofman, Marjolein C. ;
Terband, Hayo ;
Snik, Ad F. M. ;
Maassen, Ben A. M. .
FOLIA PHONIATRICA ET LOGOPAEDICA, 2016, 68 (04) :175-182
[6]   On-line Gaussian mixture modeling in the log-power domain for signal-to-noise ratio estimation and speech enhancement [J].
Dat, Tran Huy ;
Takeda, Kazuya ;
Itakura, Fumitada .
SPEECH COMMUNICATION, 2006, 48 (11) :1515-1527
[7]  
Donley J, 2017, SOUND ZONE TOOLS
[8]   Estimation of Room Acoustic Parameters: The ACE Challenge [J].
Eaton, James ;
Gaubitch, Nikolay D. ;
Moore, Alastair H. ;
Naylor, Patrick A. .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (10) :1681-1693
[9]  
Farina A, 2000, 108 AUD ENG SOC CONV
[10]  
Fletcher H, 1929, SPEECH AND HEARING