PARAPHRASTIC NEURAL NETWORK LANGUAGE MODELS

被引:0
作者
Liu, X. [1 ]
Gales, M. J. F. [1 ]
Woodland, P. C. [1 ]
机构
[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
来源
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2014年
关键词
neural network language model; paraphrase; speech recognition;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Expressive richness in natural languages presents a significant challenge for statistical language models (LM). As multiple word sequences can represent the same underlying meaning, only modelling the observed surface word sequence can lead to poor context coverage. To handle this issue, paraphrastic LMs were previously proposed to improve the generalization of back-off n-gram LMs. Paraphrastic neural network LMs (NNLM) are investigated in this paper. Using a paraphrastic multi-level feedforward NNLM modelling both word and phrase sequences, significant error rate reductions of 1.3% absolute (8% relative) and 0.9% absolute (5.5% relative) were obtained over the baseline n-gram and NNLM systems respectively on a state-of-the-art conversational telephone speech recognition system trained on 2000 hours of audio and 545 million words of texts.
引用
收藏
页数:5
相关论文
共 28 条
  • [1] A Survey of Paraphrasing and Textual Entailment Methods
    Androutsopoulos, Ion
    Malakasiotis, Prodromos
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2010, 38 : 135 - 187
  • [2] A neural probabilistic language model
    Bengio, Y
    Ducharme, R
    Vincent, P
    Jauvin, C
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (06) : 1137 - 1155
  • [3] Brown P. F., 1992, Computational Linguistics, V18, P467
  • [4] Bulyko I., 2003, P HLT2003 EDM CAN
  • [5] Cao G., 2005, SIGIR 2005. Proceedings of the Twenty-Eighth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P298, DOI 10.1145/1076034.1076086
  • [6] Dekang Lin, 2001, KDD-2001. Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, P323
  • [7] Evermann G, 2005, INT CONF ACOUST SPEE, P209
  • [8] Fellbaum C., 1998, WordNet, DOI DOI 10.7551/MITPRESS/7287.001.0001
  • [9] DISTRIBUTIONAL STRUCTURE
    Harris, Zellig S.
    [J]. WORD-JOURNAL OF THE INTERNATIONAL LINGUISTIC ASSOCIATION, 1954, 10 (2-3): : 146 - 162
  • [10] Hobeiman R., 2002, USING WORDNET SUPPLE