Hidden Conditional Random Fields for Phone Recognition

被引:20
|
作者
Sung, Yun-Hsuan [1 ]
Jurafsky, Dan [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
来源
2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009) | 2009年
关键词
D O I
10.1109/ASRU.2009.5373329
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We apply Hidden Conditional Random Fields (HCRFs) to the task of TIMIT phone recognition. HCRFs are discriminatively trained sequence models that augment conditional random fields with hidden states that are capable of representing subphones and mixture components. We extend HCRFs, which had previously only been applied to phone classification with known boundaries, to recognize continuous phone sequences. We use an N-best inference algorithm in both learning (to approximate all competitor phone sequences) and decoding (to marginalize over hidden states). Our monophone HCRFs achieve 28.3% phone error rate, outperforming maximum likelihood trained HMMs by 3.6%, maximum mutual information trained HMMs by 2.5%, and minimum phone error trained HMMs by 2.2%. We show that this win is partially due to HCRFs' ability to simultaneously optimize discriminative language models and acoustic models, a powerful property that has important implications for speech recognition.
引用
收藏
页码:107 / 112
页数:6
相关论文
共 50 条
  • [1] Hidden Conditional Random Fields for Face Recognition
    Yang, Huachun
    INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2012), 2013, 8768
  • [2] Hidden Conditional Random Fields for Face Recognition
    Yang, Huachun
    2013 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCES AND APPLICATIONS (CSA), 2013, : 337 - 340
  • [3] Hidden Conditional Random Fields for Gait Recognition
    Hagui, Mabrouka
    Mahjoub, Mohamed Ali
    2016 SECOND INTERNATIONAL IMAGE PROCESSING, APPLICATIONS AND SYSTEMS (IPAS), 2016,
  • [4] Hidden Conditional Random Fields for Action Recognition
    Chen, Lifang
    van der Aa, Nico
    Tan, Robby T.
    Veltkamp, Remco C.
    PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, THEORY AND APPLICATIONS (VISAPP 2014), VOL 2, 2014, : 240 - 247
  • [5] Efficient Segmental Conditional Random Fields for Phone Recognition
    He, Yanzhang
    Fosler-Lussier, Eric
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1896 - 1899
  • [6] Hidden Conditional Random Fields for Visual Speech Recognition
    Pass, Adrian
    Zhang, Jianguo
    Stewart, Darryl
    2009 13TH INTERNATIONAL MACHINE VISION AND IMAGE PROCESSING CONFERENCE, 2009, : 117 - 122
  • [7] Hand Posture Recognition Using Hidden Conditional Random Fields
    Liu, Te-Cheng
    Wang, Ko-Chih
    Tsai, Augustine
    Wang, Chieh-Chih
    2009 IEEE/ASME INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS, VOLS 1-3, 2009, : 1817 - +
  • [8] Hidden conditional random fields
    Quattoni, Ariadna
    Wang, Sybor
    Morency, Louis-Philippe
    Collins, Michael
    Darrell, Trevor
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2007, 29 (10) : 1848 - 1853
  • [9] Deep-Structured Hidden Conditional Random Fields for Phonetic Recognition
    Yu, Dong
    Deng, Li
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2986 - 2989
  • [10] Robust Incremental Hidden Conditional Random Fields for Human Action Recognition
    Vrigkas, Michalis
    Mastora, Ermioni
    Nikou, Christophoros
    Kakadiaris, Ioannis A.
    ADVANCES IN VISUAL COMPUTING, ISVC 2018, 2018, 11241 : 126 - 136