Speech Recognition Using Blind Source Separation and Dereverberation Method for Mixed Sound of Speech and Music

被引:0
|
作者
Wang, Longbiao [1 ]
Odani, Kyohei [2 ]
Kai, Atsuhiko [2 ]
Li, Weifeng [3 ]
机构
[1] Nagaoka Univ Technol, Nagaoka, Niigata 9402188, Japan
[2] Shizuoka Univ, Grad Sch Engn, Hamamatsu, Shizuoka 4328561, Japan
[3] Tsinghua Univ, Shenzhen 100084, Peoples R China
来源
2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA) | 2013年
关键词
hands-free speech recognition; blind dereverberation; blind source separation; multi-channel least mean square; generalized spectral subtraction; INDEPENDENT COMPONENT ANALYSIS; ALGORITHM;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a method for performing a non-stationary noise reduction and dereverberation method. We use a blind dereverberation method based on spectral subtraction using a multi-channel least mean square algorithm has been proposed in our previous study. To suppress the non-stationary noise, we used a blind source separation based on an efficient fast independent component analysis algorithm. This method is evaluated using a mixed sound of speech and music, and achieves an average relative word error reduction rate of 41.9% and 7.9% compared with a baseline method and the state-of-the-art multi-step linear prediction-based dereverberation, respectively, in a real environment.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] The Influence of Blind Source Separation on Mixed Audio Speech and Music Emotion Recognition
    Laugs, Casper
    Koops, Hendrik Vincent
    Odijk, Daan
    Kaya, Heysem
    Volk, Anja
    COMPANION PUBLICATON OF THE 2020 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION (ICMI '20 COMPANION), 2020, : 67 - 71
  • [2] A Semi-blind Source Separation Approach for Speech Dereverberation
    Wang, Ziteng
    Na, Yueyue
    Liu, Zhang
    Li, Yun
    Tian, Biao
    Fu, Qiang
    INTERSPEECH 2020, 2020, : 3925 - 3929
  • [3] Blind Speech Separation and Dereverberation using neural beamforming
    Pfeifenberger, Lukas
    Pernkopf, Franz
    SPEECH COMMUNICATION, 2022, 140 : 29 - 41
  • [4] JOINT BLIND DEREVERBERATION AND SEPARATION OF SPEECH MIXTURES
    Jan, Tariqullah
    Wang, Wenwu
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 2343 - 2347
  • [5] Joint Blind Source Separation and Dereverberation for Automatic Speech Recognition using Delayed-Subsource MNMF with Localization Prior
    Fras, Mieszko
    Witkowski, Marcin
    Kowalczyk, Konrad
    INTERSPEECH 2023, 2023, : 3734 - 3738
  • [6] Online blind source separation and dereverberation of speech based on a joint diagonalizability constraint
    Yu, Ho-Gun
    Kim, Do-Hui
    Song, Min-Hwan
    Park, Hyung-Min
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2021, 40 (05): : 503 - 514
  • [7] Blind Source Separation of Noisy Mixed Speech Signals
    Li, Huiya
    Shi, Jianying
    Men, Jinxi
    SENSORS, MEASUREMENT AND INTELLIGENT MATERIALS II, PTS 1 AND 2, 2014, 475-476 : 291 - +
  • [8] Blind Separation and Dereverberation of Speech Mixtures by Joint Optimization
    Yoshioka, Takuya
    Nakatani, Tomohiro
    Miyoshi, Masato
    Okuno, Hiroshi G.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (01): : 69 - 84
  • [9] Bayesian Integration of Sound Source Separation and Speech Recognition: A New Approach to Simultaneous Speech Recognition
    Itakura, Kousuke
    Nishimuta, Izaya
    Bando, Yoshiaki
    Itoyama, Katsutoshi
    Yoshii, Kazuyoshi
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 736 - 740
  • [10] Over-determined Speech Source Separation and Dereverberation
    Togami, Masahito
    Scheibler, Robin
    2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 705 - 710