Speech Recognition Using Blind Source Separation and Dereverberation Method for Mixed Sound of Speech and Music

被引：0

作者：

Wang, Longbiao ^{[1
]}

Odani, Kyohei ^{[2
]}

Kai, Atsuhiko ^{[2
]}

Li, Weifeng ^{[3
]}

机构：

[1] Nagaoka Univ Technol, Nagaoka, Niigata 9402188, Japan

[2] Shizuoka Univ, Grad Sch Engn, Hamamatsu, Shizuoka 4328561, Japan

[3] Tsinghua Univ, Shenzhen 100084, Peoples R China

来源：

2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA) | 2013年

关键词：

hands-free speech recognition; blind dereverberation; blind source separation; multi-channel least mean square; generalized spectral subtraction; INDEPENDENT COMPONENT ANALYSIS; ALGORITHM;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, we propose a method for performing a non-stationary noise reduction and dereverberation method. We use a blind dereverberation method based on spectral subtraction using a multi-channel least mean square algorithm has been proposed in our previous study. To suppress the non-stationary noise, we used a blind source separation based on an efficient fast independent component analysis algorithm. This method is evaluated using a mixed sound of speech and music, and achieves an average relative word error reduction rate of 41.9% and 7.9% compared with a baseline method and the state-of-the-art multi-step linear prediction-based dereverberation, respectively, in a real environment.

引用

页数：4

共 50 条

[21] Speech dereverberation and source separation using DNN-WPE and LWPR-PCA
Sheeja, Jasmine J. C.
Sankaragomathi, B.
NEURAL COMPUTING & APPLICATIONS, 2023, 35 (10): : 7339 - 7356
[22] Speech dereverberation and source separation using DNN-WPE and LWPR-PCA
Jasmine J. C. Sheeja
B. Sankaragomathi
Neural Computing and Applications, 2023, 35 : 7339 - 7356
[23] Improvement of Speech Recognition for Robots Using Blind Signal Separation
Bicher, Daniel
Kroll-Peters, Olaf
Lee, Thebin
Tiotuico, Natascha
Wilhelm, Mathias
ISCGAV'08: PROCEEDINGS OF THE 8TH WSEAS INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMPUTATIONAL GEOMETRY AND ARTIFICIAL VISION, 2008, : 52 - 55
[24] Using blind source separation techniques to improve speech recognition in bilateral cochlear implant patients
Kokkinakis, Kostas
Loizou, Philipos C.
1600, Acoustical Society of America, 2 Huntington Quadrangle, Ste 1NO1, Melville, NY 11747-4502, United States (123):
[25] Using blind source separation techniques to improve speech recognition in bilateral cochlear implant patients
Kokkinakis, Kostas
Loizou, Philipos C.
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2008, 123 (04): : 2379 - 2390
[26] Speech recognition in mixed sound of speech and music based on vector quantization and non-negative matrix factorization
Nakano, Shoichi
Yamamoto, Kazumasa
Nakagawa, Seiichi
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1792 - 1795
[27] Recognition of pure music from speech sound-music mixed part of audio signal
Kong, Ling-Zhi
Luo, Sen-Lin
Zhang, Bing
Wang, Yao-Wei
Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2009, 29 (01): : 63 - 67
[28] Towards speech recognition oriented dereverberation
Jinachitra, P
Prieto, RE
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 437 - 440
[29] Sound Source Separation for Plural Passenger Speech Recognition in Smart Mobility System
Fukui, Masahiro
Watanabe, Toshihiko
Kanazawa, Minato
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2018, 64 (03) : 399 - 405
[30] SPEECH DEREVERBERATION USING A LEARNED SPEECH MODEL
Liang, Dawen
Hoffman, Matthew D.
Mysore, Gautham J.
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 1871 - 1875

← 1 2 3 4 5 →