Speech Recognition Using Blind Source Separation and Dereverberation Method for Mixed Sound of Speech and Music

被引:0
|
作者
Wang, Longbiao [1 ]
Odani, Kyohei [2 ]
Kai, Atsuhiko [2 ]
Li, Weifeng [3 ]
机构
[1] Nagaoka Univ Technol, Nagaoka, Niigata 9402188, Japan
[2] Shizuoka Univ, Grad Sch Engn, Hamamatsu, Shizuoka 4328561, Japan
[3] Tsinghua Univ, Shenzhen 100084, Peoples R China
来源
2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA) | 2013年
关键词
hands-free speech recognition; blind dereverberation; blind source separation; multi-channel least mean square; generalized spectral subtraction; INDEPENDENT COMPONENT ANALYSIS; ALGORITHM;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a method for performing a non-stationary noise reduction and dereverberation method. We use a blind dereverberation method based on spectral subtraction using a multi-channel least mean square algorithm has been proposed in our previous study. To suppress the non-stationary noise, we used a blind source separation based on an efficient fast independent component analysis algorithm. This method is evaluated using a mixed sound of speech and music, and achieves an average relative word error reduction rate of 41.9% and 7.9% compared with a baseline method and the state-of-the-art multi-step linear prediction-based dereverberation, respectively, in a real environment.
引用
收藏
页数:4
相关论文
共 50 条
  • [21] Speech dereverberation and source separation using DNN-WPE and LWPR-PCA
    Sheeja, Jasmine J. C.
    Sankaragomathi, B.
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (10): : 7339 - 7356
  • [22] Speech dereverberation and source separation using DNN-WPE and LWPR-PCA
    Jasmine J. C. Sheeja
    B. Sankaragomathi
    Neural Computing and Applications, 2023, 35 : 7339 - 7356
  • [23] Improvement of Speech Recognition for Robots Using Blind Signal Separation
    Bicher, Daniel
    Kroll-Peters, Olaf
    Lee, Thebin
    Tiotuico, Natascha
    Wilhelm, Mathias
    ISCGAV'08: PROCEEDINGS OF THE 8TH WSEAS INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMPUTATIONAL GEOMETRY AND ARTIFICIAL VISION, 2008, : 52 - 55
  • [24] Using blind source separation techniques to improve speech recognition in bilateral cochlear implant patients
    Kokkinakis, Kostas
    Loizou, Philipos C.
    1600, Acoustical Society of America, 2 Huntington Quadrangle, Ste 1NO1, Melville, NY 11747-4502, United States (123):
  • [25] Using blind source separation techniques to improve speech recognition in bilateral cochlear implant patients
    Kokkinakis, Kostas
    Loizou, Philipos C.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2008, 123 (04): : 2379 - 2390
  • [26] Speech recognition in mixed sound of speech and music based on vector quantization and non-negative matrix factorization
    Nakano, Shoichi
    Yamamoto, Kazumasa
    Nakagawa, Seiichi
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1792 - 1795
  • [27] Recognition of pure music from speech sound-music mixed part of audio signal
    Kong, Ling-Zhi
    Luo, Sen-Lin
    Zhang, Bing
    Wang, Yao-Wei
    Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2009, 29 (01): : 63 - 67
  • [28] Towards speech recognition oriented dereverberation
    Jinachitra, P
    Prieto, RE
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 437 - 440
  • [29] Sound Source Separation for Plural Passenger Speech Recognition in Smart Mobility System
    Fukui, Masahiro
    Watanabe, Toshihiko
    Kanazawa, Minato
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2018, 64 (03) : 399 - 405
  • [30] SPEECH DEREVERBERATION USING A LEARNED SPEECH MODEL
    Liang, Dawen
    Hoffman, Matthew D.
    Mysore, Gautham J.
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 1871 - 1875