Joint Dereverberation and Beamforming With Blind Estimation of the Shape Parameter of the Desired Source Prior

被引:1
|
作者
Yadav, Shekhar Kumar [1 ]
George, Nithin V. [1 ]
机构
[1] Indian Inst Technol Gandhinagar, Dept Elect Engn, Palaj 382355, India
关键词
Microphone array; dereverberation; acoustic beamforming; student's t-distribution; SPEECH DEREVERBERATION; MAXIMUM-LIKELIHOOD; CANCELLATION; REVERBERANT; QUALITY;
D O I
10.1109/TASLP.2023.3335000
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Dereverberation and acoustic beamforming is used to capture the speech of a desired speaker in the presence of interfering speakers in a reverberant room using an array of microphones. Traditionally, to perform these two tasks, the desired speech is modelled in the time-frequency domain using a complex Gaussian (CG) prior with time-varying variances. The shape parameter of the prior distribution is fixed at the same value for all time-frequency bins. In this work, we propose to model the inverse of the variance (i.e. the precision parameter) of the CG prior distribution which controls the shape of the distribution as a Gamma distributed random variable. The hyperparameters of the Gamma distribution are then estimated based on the data captured by the microphones. This data-dependent blind estimation of the shape of the prior distribution helps the proposed algorithm to accurately model the desired speech and adapt to different speakers and acoustic scenarios better than algorithms with a fixed shape parameter. We use maximum likelihood techniques to estimate the multi-channel linear prediction (MCLP) dereverberation coefficients and the beamforming weights using the proposed signal model. The stochastically latent precision parameters are obtained by estimating the hyperparameters using the expectation maximization (EM) method. For the online version of the algorithm, a recursive EM method is also proposed for real-time processing. Extensive simulation results show improved dereverberation and interference cancellation performance of the proposed method highlighting the importance of not choosing the shape parameter of the prior distribution manually.
引用
收藏
页码:779 / 793
页数:15
相关论文
共 49 条
  • [1] LOW LATENCY ONLINE BLIND SOURCE SEPARATION BASED ON JOINT OPTIMIZATION WITH BLIND DEREVERBERATION
    Ueda, Tetsuya
    Nakatani, Tomohiro
    Ikeshita, Rintaro
    Kinoshita, Keisuke
    Araki, Shoko
    Makino, Shoji
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 506 - 510
  • [2] RELAXED DISJOINTNESS BASED CLUSTERING FOR JOINT BLIND SOURCE SEPARATION AND DEREVERBERATION
    Ito, Nobutaka
    Araki, Shoko
    Yoshioka, Takuya
    Nakatani, Tomohiro
    2014 14TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2014, : 268 - 272
  • [3] Joint source-channel modeling and estimation for speech dereverberation
    Juang, Biing-Hwang
    Nakatani, Tomohiro
    2007 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, 2007, : 2990 - 2993
  • [4] A low-complexity joint optimization of blind source separation and dereverberation
    Wang T.
    Yang F.
    Yang J.
    Shengxue Xuebao/Acta Acustica, 2024, 49 (01): : 163 - 170
  • [5] Independent Vector Extraction for Fast Joint Blind Source Separation and Dereverberation
    Ikeshita, Rintaro
    Nakatani, Tomohiro
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 972 - 976
  • [6] Joint Blind Source Separation and Dereverberation for Automatic Speech Recognition using Delayed-Subsource MNMF with Localization Prior
    Fras, Mieszko
    Witkowski, Marcin
    Kowalczyk, Konrad
    INTERSPEECH 2023, 2023, : 3734 - 3738
  • [7] Online blind source separation and dereverberation of speech based on a joint diagonalizability constraint
    Yu, Ho-Gun
    Kim, Do-Hui
    Song, Min-Hwan
    Park, Hyung-Min
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2021, 40 (05): : 503 - 514
  • [8] AUTOREGRESSIVE FAST MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION FOR JOINT BLIND SOURCE SEPARATION AND DEREVERBERATION
    Sekiguchi, Kouhei
    Bando, Yoshiaki
    Nugraha, Aditya Arie
    Fontaine, Mathieu
    Yoshii, Kazuyoshi
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 511 - 515
  • [9] Deep Beamforming for Joint Direction of Arrival Estimation and Source Detection
    Chaudhari, Shreyas
    Moura, Jose M. F.
    2022 56TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2022, : 1403 - 1407
  • [10] BLIND AND NEURAL NETWORK-GUIDED CONVOLUTIONAL BEAMFORMER FOR JOINT DENOISING, DEREVERBERATION, AND SOURCE SEPARATION
    Nakatani, Tomohiro
    Ikeshita, Rintaro
    Kinoshita, Keisuke
    Sawada, Hiroshi
    Araki, Shoko
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6129 - 6133