Multi-task Learning Deep Neural Networks For Speech Feature Denoising

被引:0
|
作者
Huang, Bin [1 ]
Ke, Dengfeng [2 ]
Zheng, Hao [2 ]
Xu, Bo [2 ]
Xu, Yanyan [1 ]
Su, Kaile [3 ]
机构
[1] Beijing Forestry Univ, Sch Informat Sci & Technol, Beijing, Peoples R China
[2] Chinese Acad Sci, Inst Automat, Beijing, Peoples R China
[3] Griffith Univ, Inst Integrated & Intelligent Syst, Brisbane, Qld, Australia
关键词
multi-task learning; feature denoising; deep neural networks; ENHANCEMENT;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Traditional automatic speech recognition (ASR) systems usually get a sharp performance drop when noise presents in speech. To make a robust ASR, we introduce a new model using the multi-task learning deep neural networks (MTL-DNN) to solve the speech denoising task in feature level. In this model, the networks are initialized by pre-training restricted Boltzmann machines (RBM) and fine-tuned by jointly learning multiple interactive tasks using a shared representation. In multi-task learning, we choose a noisy-clean speech pair fitting task as the primary task and separately explore two constraints as the secondary tasks: phone label and phone cluster. In experiments, the denoised speech is reconstructed by the MTL-DNN using the noisy speech as input and it is respectively evaluated by the DNN-hidden Markov model (HMM) based and the Gaussian Mixture Model (GMM)-HMM based ASR systems. Results show that, using the denoised speech, the word error rate (WER) is respectively reduced by 53.14% and 34.84% compared with baselines. The MTL-DNN model also outperforms the general single-task learning deep neural networks (STL-DNN) model with a performance improvement of 4.93% and 3.88% respectively.
引用
收藏
页码:2464 / 2468
页数:5
相关论文
共 50 条
  • [41] Multi-Task Learning Based on Stochastic Configuration Neural Networks
    Dong, Xue-Mei
    Kong, Xudong
    Zhang, Xiaoping
    FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2022, 10
  • [42] Multi-Task Reinforcement Meta-Learning in Neural Networks
    Shakah, Ghazi
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (07) : 263 - 269
  • [43] Learning Task Relational Structure for Multi-Task Feature Learning
    Wang, De
    Nie, Feiping
    Huang, Heng
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2016, : 1239 - 1244
  • [44] Utilization of age information for speaker verification using multi-task learning deep neural networks
    Kim, Ju-ho
    Heo, Hee-Soo
    Jung, Jee-weon
    Shim, Hye-jin
    Kim, Seung-Bin
    Yu, Ha-Jin
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2019, 38 (05): : 593 - 600
  • [45] Multi-task learning of deep neural networks for joint automatic speaker verification and spoofing detection
    Li, Jiakang
    Sun, Meng
    Zhang, Xiongwei
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1517 - 1522
  • [46] Multi-Task Deep Neural Networks for Multi-Document Reading Comprehension
    Liu, Chang
    Liu, Zhuang
    Lin, Wayne
    Zhao, Jun
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [47] Predicting human protein function with multi-task deep neural networks
    Fa, Rui
    Cozzetto, Domenico
    Wan, Cen
    Jones, David T.
    PLOS ONE, 2018, 13 (06):
  • [48] GemNN: Gating-Enhanced Multi-Task Neural Networks with Feature Interaction Learning for CTR Prediction
    Fei, Hongliang
    Zhang, Jingyuan
    Zhou, Xingxuan
    Zhao, Junhao
    Qi, Xinyang
    Li, Ping
    SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 2166 - 2171
  • [49] Multi-Task Deep Neural Networks for Multimodal Personality Trait Prediction
    Mujtaba, Dena F.
    Mahapatra, Nihar R.
    2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2021), 2021, : 85 - 91
  • [50] Pareto Multi-task Deep Learning
    Riccio, Salvatore D.
    Dyankov, Deyan
    Jansen, Giorgio
    Di Fatta, Giuseppe
    Nicosia, Giuseppe
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II, 2020, 12397 : 132 - 141