Recurrent Neural Network Language Model with Part-of-speech for Mandarin Speech Recognition

被引：0

作者：

Gong, Caixia ^{[1
]}

Li, Xiangang ^{[1
]}

Wu, Xihong ^{[1
]}

机构：

[1] Peking Univ, Speech & Hearing Res Ctr, Key Lab Machine Percept, Minist Educ, Beijing 100871, Peoples R China

来源：

2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP) | 2014年

关键词：

part-of-speech; recurrent neural network language model; speech recognition;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recurrent neural network language models (RNNLMs) have been successfully applied in a variety of language processing applications ranging from speech recognition to machine translation. They can fight the curse of dimensionality by learning a distributed representation (word vector). The components of these vectors measure the co-occurrence of the word with context features over a corpus. However, RNNLMs ignore the fact that the meaning of word can vary substantially in different contexts (e.g., for polysemous words). In this paper, we investigate part-of-speech information to address this issue to some extent on the basis of information about the meaning of a word they could provide. Experimental results on Mandarin speech recognition task show that a significant character error reduction of 1.18% absolute (7.72% relative) was obtained when using recurrent neural network language model with part-of-speech.

引用

页码：459 / 463

页数：5

共 14 条

[1]

[Anonymous], 2007, P 24 INT C MACH LEAR, DOI DOI 10.1145/1273496.1273577

[2] A neural probabilistic language model [J].

Bengio, Y ;

Ducharme, R ;

Vincent, P ;

Jauvin, C .

JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (06) :1137-1155

[3]

Bilmes JeffA., 2003, P C HUMAN LANGUAGE T, P4, DOI DOI 10.3115/1073483.1073485

[4]

Brown P. F., 1992, Computational Linguistics, V18, P467

[5] Structured language modeling [J].

Chelba, C ;

Jelinek, F .

COMPUTER SPEECH AND LANGUAGE, 2000, 14 (04) :283-332

[6]

KNESER R, 1995, INT CONF ACOUST SPEE, P181, DOI 10.1109/ICASSP.1995.479394

[7]

Mikolov T., 2011, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU), P196, DOI 10.1109/ASRU.2011.6163930

[8]

Mikolov T, 2011, 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, P612

[9]

Mikolov T, 2010, 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, P1045

[10]

Mikolov T, 2011, INT CONF ACOUST SPEE, P5528

← 1 2 →