Recurrent Neural Network Language Model with Part-of-speech for Mandarin Speech Recognition

被引:0
作者
Gong, Caixia [1 ]
Li, Xiangang [1 ]
Wu, Xihong [1 ]
机构
[1] Peking Univ, Speech & Hearing Res Ctr, Key Lab Machine Percept, Minist Educ, Beijing 100871, Peoples R China
来源
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP) | 2014年
关键词
part-of-speech; recurrent neural network language model; speech recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recurrent neural network language models (RNNLMs) have been successfully applied in a variety of language processing applications ranging from speech recognition to machine translation. They can fight the curse of dimensionality by learning a distributed representation (word vector). The components of these vectors measure the co-occurrence of the word with context features over a corpus. However, RNNLMs ignore the fact that the meaning of word can vary substantially in different contexts (e.g., for polysemous words). In this paper, we investigate part-of-speech information to address this issue to some extent on the basis of information about the meaning of a word they could provide. Experimental results on Mandarin speech recognition task show that a significant character error reduction of 1.18% absolute (7.72% relative) was obtained when using recurrent neural network language model with part-of-speech.
引用
收藏
页码:459 / 463
页数:5
相关论文
共 14 条
[1]  
[Anonymous], 2007, P 24 INT C MACH LEAR, DOI DOI 10.1145/1273496.1273577
[2]   A neural probabilistic language model [J].
Bengio, Y ;
Ducharme, R ;
Vincent, P ;
Jauvin, C .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (06) :1137-1155
[3]  
Bilmes JeffA., 2003, P C HUMAN LANGUAGE T, P4, DOI DOI 10.3115/1073483.1073485
[4]  
Brown P. F., 1992, Computational Linguistics, V18, P467
[5]   Structured language modeling [J].
Chelba, C ;
Jelinek, F .
COMPUTER SPEECH AND LANGUAGE, 2000, 14 (04) :283-332
[6]  
KNESER R, 1995, INT CONF ACOUST SPEE, P181, DOI 10.1109/ICASSP.1995.479394
[7]  
Mikolov T., 2011, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU), P196, DOI 10.1109/ASRU.2011.6163930
[8]  
Mikolov T, 2011, 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, P612
[9]  
Mikolov T, 2010, 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, P1045
[10]  
Mikolov T, 2011, INT CONF ACOUST SPEE, P5528