Reverberant Speech Recognition Based on Denoising Autoencoder

被引:0
作者
Ishii, Takaaki [1 ]
Komiyama, Hiroki [1 ]
Shinozaki, Takahiro [2 ]
Horiuchi, Yasuo [1 ]
Kuroiwa, Shingo [1 ]
机构
[1] Chiba Univ, Grad Sch Adv Integrat Sci, Div Informat Sci, Chiba, Japan
[2] Tokyo Inst Technol, Interdisciplinary Grad Sch Sci & Engn, Dept Informat Proc, Tokyo, Japan
来源
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 | 2013年
关键词
Denoising autoencoder; reverberant speech recognition; restricted Boltzmann machine; distant-talking speech recognition; CENSREC-4; REPRESENTATIONS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Denoising autoencoder is applied to reverberant speech recognition as a noise robust front-end to reconstruct clean speech spectrum from noisy input. In order to capture context effects of speech sounds, a window of multiple short-windowed spectral frames are concatenated to form a single input vector. Additionally, a combination of short and long-term spectra is investigated to properly handle long impulse response of reverberation while keeping necessary time resolution for speech recognition. Experiments are performed using the CENSREC-4 dataset that is designed as an evaluation framework for distant-talking speech recognition. Experimental results show that the proposed denoising autoencoder based front-end using the short-windowed spectra gives better results than conventional methods. By combining the long-term spectra, further improvement is obtained. The recognition accuracy by the proposed method using the short and long-term spectra is 97.0% for the open condition test set of the dataset, whereas it is 87.8% when a multi condition training based baseline is used. As a supplemental experiment, large vocabulary speech recognition is also performed and the effectiveness of the proposed method has been confirmed.
引用
收藏
页码:3479 / 3483
页数:5
相关论文
共 50 条
  • [41] Auxiliary Stacked Denoising Autoencoder based Collaborative Filtering Recommendation
    Mu, Ruihui
    Zeng, Xiaoqin
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2020, 14 (06): : 2310 - 2332
  • [42] Domain adaptation network based on hypergraph regularized denoising autoencoder
    Wang, Xuesong
    Ma, Yuting
    Cheng, Yuhu
    ARTIFICIAL INTELLIGENCE REVIEW, 2019, 52 (03) : 2061 - 2079
  • [43] A blind denoising method of vibration signals based on a deep autoencoder
    Wan R.
    Zhang C.
    Jiang H.
    Li Y.
    Zhendong yu Chongji/Journal of Vibration and Shock, 2023, 42 (12): : 118 - 125
  • [44] Nonlinear process monitoring based on load weighted denoising autoencoder
    Zhu, Jiazhen
    Shi, Hongbo
    Song, Bing
    Tao, Yang
    Tan, Shuai
    Zhang, Tianqing
    MEASUREMENT, 2021, 171
  • [45] MULTI-TASK AUTOENCODER FOR NOISE-ROBUST SPEECH RECOGNITION
    Zhang, Haoyi
    Liu, Conggui
    Inoue, Nakamasa
    Shinoda, Koichi
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5599 - 5603
  • [46] Radar Signal Intra-Pulse Modulation Recognition Based on Convolutional Denoising Autoencoder and Deep Convolutional Neural Network
    Qu, Zhiyu
    Wang, Wenyang
    Hou, Changbo
    Hou, Chenfan
    IEEE ACCESS, 2019, 7 : 112339 - 112347
  • [47] ON DNN POSTERIOR PROBABILITY COMBINATION IN MULTI-STREAM SPEECH RECOGNITION FOR REVERBERANT ENVIRONMENTS
    Xiong, Feifei
    Goetze, Stefan
    Meyer, Bernd T.
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5250 - 5254
  • [48] A transfer learning framework for well placement optimization based on denoising autoencoder
    Qi, Ji
    Liu, Yanqing
    Ju, Yafeng
    Zhang, Kai
    Liu, Lu
    Liu, Yuanyuan
    Xue, Xiaoming
    Zhang, Liming
    Zhang, Huaqing
    Wang, Haochen
    Yao, Jun
    Zhang, Weidong
    GEOENERGY SCIENCE AND ENGINEERING, 2023, 222
  • [49] DEVDAN: Deep evolving denoising autoencoder
    Ashfahani, Andri
    Pratama, Mahardhika
    Lughofer, Edwin
    Ong, Yew Soon
    NEUROCOMPUTING, 2020, 390 : 297 - 314
  • [50] Denoising Autoencoder-Based Missing Value Imputation for Smart Meters
    Ryu, Seunghyoung
    Kim, Minsoo
    Kim, Hongseok
    IEEE ACCESS, 2020, 8 : 40656 - 40666