Hidden Conditional Random Fields for Phone Recognition

被引:20
|
作者
Sung, Yun-Hsuan [1 ]
Jurafsky, Dan [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
来源
2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009) | 2009年
关键词
D O I
10.1109/ASRU.2009.5373329
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We apply Hidden Conditional Random Fields (HCRFs) to the task of TIMIT phone recognition. HCRFs are discriminatively trained sequence models that augment conditional random fields with hidden states that are capable of representing subphones and mixture components. We extend HCRFs, which had previously only been applied to phone classification with known boundaries, to recognize continuous phone sequences. We use an N-best inference algorithm in both learning (to approximate all competitor phone sequences) and decoding (to marginalize over hidden states). Our monophone HCRFs achieve 28.3% phone error rate, outperforming maximum likelihood trained HMMs by 3.6%, maximum mutual information trained HMMs by 2.5%, and minimum phone error trained HMMs by 2.2%. We show that this win is partially due to HCRFs' ability to simultaneously optimize discriminative language models and acoustic models, a powerful property that has important implications for speech recognition.
引用
收藏
页码:107 / 112
页数:6
相关论文
共 50 条
  • [21] Hidden Conditional Random Field with Distribution Constraints for Phone Classification
    Yu, Dong
    Deng, Li
    Acero, Alex
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 668 - 671
  • [22] Minimum Classification Error Training of Hidden Conditional Random Fields for Speech and Speaker Recognition
    Hong, Wei-Tyng
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2013, 29 (04) : 729 - 742
  • [23] Learning Partially-Observed Hidden Conditional Random Fields for Facial Expression Recognition
    Chang, Kai-Yueh
    Liu, Tyng-Luh
    Lai, Shang-Hong
    CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4, 2009, : 533 - +
  • [24] Dynamic Perceptual Attribute-Based Hidden Conditional Random Fields for Gesture Recognition
    Hu, Gang
    Gao, Qigang
    IMAGE ANALYSIS AND RECOGNITION (ICIAR 2015), 2015, 9164 : 259 - 268
  • [25] Regularization, adaptation, and non-independent features improve Hidden Conditional Random Fields for phone classification
    Sung, Yun-Hsuan
    Boulis, Constantinos
    Manning, Christopher
    Jurafsky, Dan
    2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 347 - 352
  • [26] Variational Hidden Conditional Random Fields with Beta Processes
    Luo, Chen
    Sun, Shiliang
    Zhao, Jing
    2017 13TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2017,
  • [27] Hidden Conditional Ordinal Random Fields for Sequence Classification
    Kim, Minyoung
    Pavlovic, Vladimir
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT II: EUROPEAN CONFERENCE, ECML PKDD 2010, 2010, 6322 : 51 - 65
  • [28] Hyperparameter tuning for hidden unit conditional random fields
    Yang, Eun-Suk
    Kim, Jong Dae
    Park, Chan-Young
    Song, Hye-Jeong
    Kim, Yu-Seop
    ENGINEERING COMPUTATIONS, 2017, 34 (06) : 2054 - 2062
  • [29] Surface Electromyography and Acceleration Based Sign Language Recognition Using Hidden Conditional Random Fields
    Ma, Deen
    Chen, Xiang
    Li, Yun
    Cheng, Juan
    Ma, Yuncong
    2012 IEEE EMBS CONFERENCE ON BIOMEDICAL ENGINEERING AND SCIENCES (IECBES), 2012,
  • [30] Gaussian Conditional Random Fields for Face Recognition
    Smereka, Jonathon M.
    Kumar, B. V. K. Vijaya
    Rodriguez, Andres
    PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016), 2016, : 155 - 162