Speech Recognition Using Blind Source Separation and Dereverberation Method for Mixed Sound of Speech and Music

被引：0

作者：

Wang, Longbiao ^{[1
]}

Odani, Kyohei ^{[2
]}

Kai, Atsuhiko ^{[2
]}

Li, Weifeng ^{[3
]}

机构：

[1] Nagaoka Univ Technol, Nagaoka, Niigata 9402188, Japan

[2] Shizuoka Univ, Grad Sch Engn, Hamamatsu, Shizuoka 4328561, Japan

[3] Tsinghua Univ, Shenzhen 100084, Peoples R China

来源：

2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA) | 2013年

关键词：

hands-free speech recognition; blind dereverberation; blind source separation; multi-channel least mean square; generalized spectral subtraction; INDEPENDENT COMPONENT ANALYSIS; ALGORITHM;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, we propose a method for performing a non-stationary noise reduction and dereverberation method. We use a blind dereverberation method based on spectral subtraction using a multi-channel least mean square algorithm has been proposed in our previous study. To suppress the non-stationary noise, we used a blind source separation based on an efficient fast independent component analysis algorithm. This method is evaluated using a mixed sound of speech and music, and achieves an average relative word error reduction rate of 41.9% and 7.9% compared with a baseline method and the state-of-the-art multi-step linear prediction-based dereverberation, respectively, in a real environment.

引用

页数：4

共 50 条

[1] The Influence of Blind Source Separation on Mixed Audio Speech and Music Emotion Recognition
Laugs, Casper
Koops, Hendrik Vincent
Odijk, Daan
Kaya, Heysem
Volk, Anja
COMPANION PUBLICATON OF THE 2020 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION (ICMI '20 COMPANION), 2020, : 67 - 71
[2] A Semi-blind Source Separation Approach for Speech Dereverberation
Wang, Ziteng
Na, Yueyue
Liu, Zhang
Li, Yun
Tian, Biao
Fu, Qiang
INTERSPEECH 2020, 2020, : 3925 - 3929
[3] Blind Speech Separation and Dereverberation using neural beamforming
Pfeifenberger, Lukas
Pernkopf, Franz
SPEECH COMMUNICATION, 2022, 140 : 29 - 41
[4] JOINT BLIND DEREVERBERATION AND SEPARATION OF SPEECH MIXTURES
Jan, Tariqullah
Wang, Wenwu
2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 2343 - 2347
[5] Joint Blind Source Separation and Dereverberation for Automatic Speech Recognition using Delayed-Subsource MNMF with Localization Prior
Fras, Mieszko
Witkowski, Marcin
Kowalczyk, Konrad
INTERSPEECH 2023, 2023, : 3734 - 3738
[6] Online blind source separation and dereverberation of speech based on a joint diagonalizability constraint
Yu, Ho-Gun
Kim, Do-Hui
Song, Min-Hwan
Park, Hyung-Min
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2021, 40 (05): : 503 - 514
[7] Blind Source Separation of Noisy Mixed Speech Signals
Li, Huiya
Shi, Jianying
Men, Jinxi
SENSORS, MEASUREMENT AND INTELLIGENT MATERIALS II, PTS 1 AND 2, 2014, 475-476 : 291 - +
[8] Blind Separation and Dereverberation of Speech Mixtures by Joint Optimization
Yoshioka, Takuya
Nakatani, Tomohiro
Miyoshi, Masato
Okuno, Hiroshi G.
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (01): : 69 - 84
[9] Bayesian Integration of Sound Source Separation and Speech Recognition: A New Approach to Simultaneous Speech Recognition
Itakura, Kousuke
Nishimuta, Izaya
Bando, Yoshiaki
Itoyama, Katsutoshi
Yoshii, Kazuyoshi
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 736 - 740
[10] Over-determined Speech Source Separation and Dereverberation
Togami, Masahito
Scheibler, Robin
2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 705 - 710

← 1 2 3 4 5 →