JOINT MAXIMUM LIKELIHOOD ESTIMATION OF LATE REVERBERANT AND SPEECH POWER SPECTRAL DENSITY IN NOISY ENVIRONMENTS

被引:0
|
作者
Schwartz, Ofer [1 ]
Gannot, Sharon [1 ]
Habets, Emanueel A. P. [2 ]
机构
[1] Bar Ilan Univ, Fac Engn, IL-52900 Ramat Gan, Israel
[2] Int Audio Labs Erlangen, Wolfsmantel 33, D-91058 Erlangen, Germany
来源
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS | 2016年
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
An estimate of the power spectral density (PSD) of the late reverberation is often required by dereverberation algorithms. In this work, we derive a novel multichannel maximum likelihood (ML) estimator for the PSD of the reverberation that can be applied in noisy environments. Since the anechoic speech PSD is usually unknown in advance, it is estimated as well. As a closed-form solution for the maximum likelihood estimator is unavailable, a Newton method for maximizing the ML criterion is derived. Experimental results show that the proposed estimator provides an accurate estimate of the PSD, and outperforms competing estimators. Moreover, when used in a multi-microphone dereverberation and noise reduction algorithm, the best performance in terms of the log-spectral distance is achieved when employing the proposed PSD estimator.
引用
收藏
页码:151 / 155
页数:5
相关论文
共 50 条
  • [1] MAXIMUM LIKELIHOOD ESTIMATION OF THE LATE REVERBERANT POWER SPECTRAL DENSITY IN NOISY ENVIRONMENTS
    Schwartz, Ofer
    Braun, Sebastian
    Gannot, Sharon
    Habets, Emanuel A. P.
    2015 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2015,
  • [2] JOINT ESTIMATION OF LATE REVERBERANT AND SPEECH POWER SPECTRAL DENSITIES IN NOISY ENVIRONMENTS USING FROBENIUS NORM
    Schwartz, Ofer
    Gannot, Sharon
    Habets, Emanuel A. P.
    2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 1123 - 1127
  • [3] MAXIMUM LIKELIHOOD PSD ESTIMATION FOR SPEECH ENHANCEMENT IN REVERBERANT AND NOISY CONDITIONS
    Kuklasinski, Adam
    Doclo, Simon
    Jensen, Jesper
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 599 - 603
  • [4] Maximum likelihood approach to speech enhancement for noisy reverberant signals
    Yoshioka, Takuya
    Nakatani, Tomohiro
    Hikichi, Takafumi
    Miyoshi, Masato
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4585 - 4588
  • [5] TDOA ESTIMATION OF SPEECH SOURCE IN NOISY REVERBERANT ENVIRONMENTS
    Bu, Suliang
    Zhao, Tuo
    Zhao, Yunxin
    2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 1059 - 1066
  • [6] MAXIMUM LIKELIHOOD ESTIMATION OF THE DIRECTION OF SOUND IN A REVERBERANT NOISY ENVIRONMENT
    Mansour, Mohamed F.
    2024 18TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT, IWAENC 2024, 2024, : 16 - 20
  • [7] LATE REVERBERANT POWER SPECTRAL DENSITY ESTIMATION BASED ON AN EIGENVALUE DECOMPOSITION
    Kodrasi, Ina
    Doclo, Simon
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 611 - 615
  • [8] Reverberation Time Estimation based on a Model for the Power Spectral Density of Reverberant Speech
    Faraji, Neda
    Ahadi, Seyed Mohammad
    Sheikhzadeh, Hamid
    2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 1453 - 1457
  • [9] Enhancement of Reverberant Speech in Noisy Acoustical Environments
    Joorabchi, Marjan
    Ghorshi, Seyed
    Sarafnia, Ali
    2014 SIXTH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2014,
  • [10] Speech Emotion Recognition in Noisy and Reverberant Environments
    Heracleous, Panikos
    Yasuda, Keiji
    Sugaya, Fumiaki
    Yoneyama, Akio
    Hashimoto, Masayuki
    2017 SEVENTH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2017, : 262 - 266