INVESTIGATION OF DEEP BOLTZMANN MACHINES FOR PHONE RECOGNITION

被引:0
|
作者
You, Zhao [1 ]
Wang, Xiaorui [1 ]
Xu, Bo [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, Interact Digital Media Technol Res Ctr, Beijing, Peoples R China
来源
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2013年
关键词
phone recognition; acoustic modeling; Deep Boltzmann Machines; Deep Neural Networks;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In the past few years, deep neural networks (DNNs) achieved great successes in speech recognition. The layer-wise pre-trained deep belief network (DBN) is known as one of the critical factor to optimize the DNN. However, the DBN has one shortcoming that the pre-training procedure is in a greedy forward pass. The top-down influences on the inference process are ignored, thus the pre-trained DBN is suboptimal. In this paper, we attempt to apply deep Boltzmann machine (DBM) on acoustic modeling. DBM has the advantages that a top-down feedback is incorporated and the parameters of all layers can be jointly optimized. Experiments are conducted on the TIMIT phone recognition task to investigate the DBM-DNN acoustic model. Comparing with the DBN-DNN with same amount of parameters, phone error rate on the core test set is reduced by 3.8% relatively, and additional 5.1% by dropout fine-tuning.
引用
收藏
页码:7600 / 7603
页数:4
相关论文
共 50 条
  • [11] Deep-FS: A feature selection algorithm for Deep Boltzmann Machines
    Taherkhani, Aboozar
    Cosma, Georgina
    McGinnity, T. M.
    NEUROCOMPUTING, 2018, 322 : 22 - 37
  • [12] Convolutional Deep Maxout Networks for Phone Recognition
    Toth, Laszlo
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1078 - 1082
  • [13] Rolling Bearing Fault Diagnosis based on Deep Boltzmann Machines
    Deng, Shengcai
    Cheng, Zhiwei
    Li, Chuan
    Yao, Xingyan
    Chen, Zhiqiang
    Sanchez, Rene-Vinicio
    2016 PROGNOSTICS AND SYSTEM HEALTH MANAGEMENT CONFERENCE (PHM-CHENGDU), 2016,
  • [14] Fault Diagnosis Method Based on Improved Deep Boltzmann Machines
    Liu, Dan
    Wang, Qin
    Tao, Jiaojiao
    Li, Guang
    Wu, Jie
    PROCEEDINGS OF 2018 IEEE 7TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS), 2018, : 458 - 462
  • [15] Text-mining the NeuroSynth corpus using Deep Boltzmann Machines
    Monti, Ricardo
    Lorenz, Romy
    Leech, Robert
    Anagnostopoulos, Christoforos
    Montana, Giovanni
    2016 6TH INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION IN NEUROIMAGING (PRNI), 2016, : 13 - 16
  • [16] RESOURCE CONFIGURABLE SPOKEN QUERY DETECTION USING DEEP BOLTZMANN MACHINES
    Zhang, Yaodong
    Salakhutdinov, Ruslan
    Chang, Hung-An
    Glass, James
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 5161 - 5164
  • [17] Posed and Spontaneous Facial Expression Differentiation Using Deep Boltzmann Machines
    Gan, Quan
    Wu, Chongliang
    Wang, Shangfei
    Ji, Qiang
    2015 INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2015, : 643 - 648
  • [18] DEEP BELIEF NETWORKS USING DISCRIMINATIVE FEATURES FOR PHONE RECOGNITION
    Mohamed, Abdel-rahman
    Sainath, Tara N.
    Dahl, George
    Ramabhadran, Bhuvana
    Hinton, Geoffrey E.
    Picheny, Michael A.
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5060 - 5063
  • [19] Fine Tuning Deep Boltzmann Machines Through Meta-Heuristic Approaches
    Passos, Leandro A.
    Rodrigues, Douglas R.
    Papa, Joao P.
    2018 IEEE 12TH INTERNATIONAL SYMPOSIUM ON APPLIED COMPUTATIONAL INTELLIGENCE AND INFORMATICS (SACI), 2018, : 419 - 424
  • [20] A Joint Deep Boltzmann Machine (jDBM) Model for Person Identification Using Mobile Phone Data
    Alam, Mohammad Rafiqul
    Bennamoun, Mohammed
    Togneri, Roberto
    Sohel, Ferdous
    IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (02) : 317 - 326