Computationally Efficient and Versatile Framework for Joint Optimization of Blind Speech Separation and Dereverberation

被引:7
|
作者
Nakatani, Tomohiro [1 ]
Ikeshita, Rintaro [1 ]
Kinoshita, Keisuke [1 ]
Sawada, Hiroshi [1 ]
Araki, Shoko [1 ]
机构
[1] NTT Corp, Tokyo, Japan
来源
关键词
Blind source separation; dereverberation; automatic speech recognition; INDEPENDENT COMPONENT ANALYSIS; MIXTURES;
D O I
10.21437/Interspeech.2020-2138
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
This paper proposes new blind signal processing techniques for optimizing a multi-input multi-output (MIMO) convolutional beamformer (CBF) in a computationally efficient way to simultaneously perform dereverberation and source separation. For effective CBF optimization, a conventional technique factorizes it into a multiple-target weighted prediction error (WPE) based dereverberation filter and a separation matrix. However, this technique requires the calculation of a huge spatio-temporal covariance matrix that reflects the statistics of all the sources, which makes the computational cost very high. For computationally efficient optimization, this paper introduces two techniques: one that decomposes the huge covariance matrix into ones for individual sources, and another that decomposes the CBF into sub-filters for estimating individual sources. Both techniques effectively and substantively reduce the size of the covariance matrices that must calculated, and allow us to greatly reduce the computational cost without loss of optimality.
引用
收藏
页码:91 / 95
页数:5
相关论文
共 50 条
  • [31] A COMPUTATIONALLY CHEAPER METHOD FOR BLIND SPEECH SEPARATION BASED ON AUXIVA AND INCOMPLETE DEMIXING TRANSFORM
    Jansky, Jakub
    Koldovsky, Zbynek
    Ono, Nobutaka
    2016 IEEE INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2016,
  • [32] A Novel Approach for Blind Separation and Dereverberation of Speech Mixtures using Multiple step Linear Predictive Coding
    Ehsan, Wajeeha
    Jan, Tariqullah
    2015 INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES (ICET), 2015,
  • [33] An Efficient Analytic Solution for Joint Blind Source Separation
    Gabrielson, Ben
    Baker Siddique Akhonda, Mohammad Abu
    Lehmann, Isabell
    Adali, Tulay
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2024, 72 : 2436 - 2449
  • [34] Blind Separation of the Joint Algorithm Based on the Cyclostationarity Optimization
    Jing-hong, Xue
    Min, Li
    2016 IEEE INTERNATIONAL CONFERENCE ON ELECTRONIC INFORMATION AND COMMUNICATION TECHNOLOGY ICEICT 2016 PROCEEDINGS, 2016, : 266 - 270
  • [35] A convolutive blind signal separation method for joint speech signal separation and echo cancellation
    Pan, QF
    Aboulnasr, T
    Proceedings of the 46th IEEE International Midwest Symposium on Circuits & Systems, Vols 1-3, 2003, : 752 - 755
  • [36] A stochastic framework for computationally efficient fail-safe topology optimization
    Zhang, Yiming
    Zhang, Hongyi
    Qiu, Lemiao
    Wang, Zili
    Zhang, Shuyou
    Qiu, Na
    Fang, Jianguang
    ENGINEERING STRUCTURES, 2023, 283
  • [37] Computationally-Efficient Overdetermined Blind Source Separation Based on Iterative Source Steering
    Du, Yicheng
    Scheibler, Robin
    Togami, Masahito
    Yoshii, Kazuyoshi
    Kawahara, Tatsuya
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 927 - 931
  • [38] Computationally Efficient Blind Source Separation-Based MIMO-PA Linearization
    Shipra
    Rawat, Meenakshi
    IEEE ACCESS, 2024, 12 : 3126 - 3139
  • [39] Joint dereverberation and blind source separation using a hybrid autoregressive and convolutive transfer function-based model
    Liu, Shengdong
    Yang, Feiran
    Chen, Rilin
    Yang, Jun
    APPLIED ACOUSTICS, 2024, 224
  • [40] BLIND SPEECH SEPARATION EMPLOYING DIRECTIONAL STATISTICS IN AN EXPECTATION MAXIMIZATION FRAMEWORK
    Dang Hai Tran Vu
    Haeb-Umbach, Reinhold
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 241 - 244