Computationally Efficient and Versatile Framework for Joint Optimization of Blind Speech Separation and Dereverberation

被引：7

作者：

Nakatani, Tomohiro ^{[1
]}

Ikeshita, Rintaro ^{[1
]}

Kinoshita, Keisuke ^{[1
]}

Sawada, Hiroshi ^{[1
]}

Araki, Shoko ^{[1
]}

机构：

[1] NTT Corp, Tokyo, Japan

来源：

INTERSPEECH 2020 | 2020年

关键词：

Blind source separation; dereverberation; automatic speech recognition; INDEPENDENT COMPONENT ANALYSIS; MIXTURES;

D O I：

10.21437/Interspeech.2020-2138

中图分类号：

R36 [病理学]; R76 [耳鼻咽喉科学];

学科分类号：

100104 ; 100213 ;

摘要：

This paper proposes new blind signal processing techniques for optimizing a multi-input multi-output (MIMO) convolutional beamformer (CBF) in a computationally efficient way to simultaneously perform dereverberation and source separation. For effective CBF optimization, a conventional technique factorizes it into a multiple-target weighted prediction error (WPE) based dereverberation filter and a separation matrix. However, this technique requires the calculation of a huge spatio-temporal covariance matrix that reflects the statistics of all the sources, which makes the computational cost very high. For computationally efficient optimization, this paper introduces two techniques: one that decomposes the huge covariance matrix into ones for individual sources, and another that decomposes the CBF into sub-filters for estimating individual sources. Both techniques effectively and substantively reduce the size of the covariance matrices that must calculated, and allow us to greatly reduce the computational cost without loss of optimality.

引用

页码：91 / 95

页数：5

共 50 条

[31] A COMPUTATIONALLY CHEAPER METHOD FOR BLIND SPEECH SEPARATION BASED ON AUXIVA AND INCOMPLETE DEMIXING TRANSFORM
Jansky, Jakub
Koldovsky, Zbynek
Ono, Nobutaka
2016 IEEE INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2016,
[32] A Novel Approach for Blind Separation and Dereverberation of Speech Mixtures using Multiple step Linear Predictive Coding
Ehsan, Wajeeha
Jan, Tariqullah
2015 INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES (ICET), 2015,
[33] An Efficient Analytic Solution for Joint Blind Source Separation
Gabrielson, Ben
Baker Siddique Akhonda, Mohammad Abu
Lehmann, Isabell
Adali, Tulay
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2024, 72 : 2436 - 2449
[34] Blind Separation of the Joint Algorithm Based on the Cyclostationarity Optimization
Jing-hong, Xue
Min, Li
2016 IEEE INTERNATIONAL CONFERENCE ON ELECTRONIC INFORMATION AND COMMUNICATION TECHNOLOGY ICEICT 2016 PROCEEDINGS, 2016, : 266 - 270
[35] A convolutive blind signal separation method for joint speech signal separation and echo cancellation
Pan, QF
Aboulnasr, T
Proceedings of the 46th IEEE International Midwest Symposium on Circuits & Systems, Vols 1-3, 2003, : 752 - 755
[36] A stochastic framework for computationally efficient fail-safe topology optimization
Zhang, Yiming
Zhang, Hongyi
Qiu, Lemiao
Wang, Zili
Zhang, Shuyou
Qiu, Na
Fang, Jianguang
ENGINEERING STRUCTURES, 2023, 283
[37] Computationally-Efficient Overdetermined Blind Source Separation Based on Iterative Source Steering
Du, Yicheng
Scheibler, Robin
Togami, Masahito
Yoshii, Kazuyoshi
Kawahara, Tatsuya
IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 927 - 931
[38] Computationally Efficient Blind Source Separation-Based MIMO-PA Linearization
Shipra
Rawat, Meenakshi
IEEE ACCESS, 2024, 12 : 3126 - 3139
[39] Joint dereverberation and blind source separation using a hybrid autoregressive and convolutive transfer function-based model
Liu, Shengdong
Yang, Feiran
Chen, Rilin
Yang, Jun
APPLIED ACOUSTICS, 2024, 224
[40] BLIND SPEECH SEPARATION EMPLOYING DIRECTIONAL STATISTICS IN AN EXPECTATION MAXIMIZATION FRAMEWORK
Dang Hai Tran Vu
Haeb-Umbach, Reinhold
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 241 - 244

← 1 2 3 4 5 →