INVESTIGATION OF DEEP BOLTZMANN MACHINES FOR PHONE RECOGNITION

被引:0
|
作者
You, Zhao [1 ]
Wang, Xiaorui [1 ]
Xu, Bo [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, Interact Digital Media Technol Res Ctr, Beijing, Peoples R China
来源
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2013年
关键词
phone recognition; acoustic modeling; Deep Boltzmann Machines; Deep Neural Networks;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In the past few years, deep neural networks (DNNs) achieved great successes in speech recognition. The layer-wise pre-trained deep belief network (DBN) is known as one of the critical factor to optimize the DNN. However, the DBN has one shortcoming that the pre-training procedure is in a greedy forward pass. The top-down influences on the inference process are ignored, thus the pre-trained DBN is suboptimal. In this paper, we attempt to apply deep Boltzmann machine (DBM) on acoustic modeling. DBM has the advantages that a top-down feedback is incorporated and the parameters of all layers can be jointly optimized. Experiments are conducted on the TIMIT phone recognition task to investigate the DBM-DNN acoustic model. Comparing with the DBN-DNN with same amount of parameters, phone error rate on the core test set is reduced by 3.8% relatively, and additional 5.1% by dropout fine-tuning.
引用
收藏
页码:7600 / 7603
页数:4
相关论文
共 50 条
  • [1] Gender Aware Deep Boltzmann Machines for Phone Recognition
    Zoughi, Toktam
    Homayounpour, Mohammad Mehdi
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [2] PHONE RECOGNITION USING RESTRICTED BOLTZMANN MACHINES
    Mohamed, Abdel-rahman
    Hinton, Geoffrey
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4354 - 4357
  • [3] Deep Boltzmann Machines based Vehicle Recognition
    Hu, Aiqin
    Li, Hong
    Zhang, Fan
    Zhang, Wei
    26TH CHINESE CONTROL AND DECISION CONFERENCE (2014 CCDC), 2014, : 3033 - 3038
  • [4] Compression by and for Deep Boltzmann Machines
    Li, Qing
    Chen, Yang
    Kim, Yongjune
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2020, 68 (12) : 7498 - 7510
  • [5] A 3D model recognition mechanism based on deep Boltzmann machines
    Leng, Biao
    Zhang, Xiangyang
    Yao, Ming
    Xiong, Zhang
    NEUROCOMPUTING, 2015, 151 : 593 - 602
  • [6] Acoustic Emotion Recognition based on Fusion of Multiple Feature-Dependent Deep Boltzmann Machines
    Poon-Feng, Kelvin
    Huang, Dong-Yan
    Dong, Minghui
    Li, Haizhou
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 584 - +
  • [7] Temperature-Based Deep Boltzmann Machines
    Passos, Leandro Aparecido, Jr.
    Papa, Joao Paulo
    NEURAL PROCESSING LETTERS, 2018, 48 (01) : 95 - 107
  • [8] PHONE RECOGNITION WITH DEEP SPARSE RECTIFIER NEURAL NETWORKS
    Toth, Laszlo
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6985 - 6989
  • [9] Convolutional Deep Rectifier Neural Nets for Phone Recognition
    Toth, Laszlo
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1721 - 1725
  • [10] Temperature-Based Deep Boltzmann Machines
    Leandro Aparecido Passos
    João Paulo Papa
    Neural Processing Letters, 2018, 48 : 95 - 107