Personalizing Recurrent-Neural-Network-Based Language Model by Social Network

被引：19

作者：

Lee, Hung-Yi ^{[1
]}

Tseng, Bo-Hsiang ^{[2
]}

Wen, Tsung-Hsien ^{[3
]}

Tsao, Yu

机构：

[1] Natl Taiwan Univ, Dept Elect Engn, Taipei 10617, Taiwan

[2] Natl Taiwan Univ, Taipei 10617, Taiwan

[3] Univ Cambridge, Dialogue Syst Grp, Cambridge CB2 1TN, England

来源：

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2017年 / 25卷 / 03期

关键词：

Personalized language modeling; recurrent neural network; social network; ADAPTATION; SYSTEM;

D O I：

10.1109/TASLP.2016.2635445

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

With the popularity of mobile devices, personalized speech recognizers have become more attainable and are highly attractive. Since each mobile device is used primarily by a single user, it is possible to have a personalized recognizer that well matches the characteristics of the individual user. Although acoustic model personalization has been investigated for decades, much less work has been reported on personalizing language models, presumably because of the difficulties in collecting sufficient personalized corpora. In this paper, we propose a general framework for personalizing recurrent-neural-network-based language models (RNNLMs) using data collected from social networks, including the posts of many individual users and friend relationships among the users. Two major directions for this are model-based and feature-based RNNLM personalization. In model-based RNNLM personalization, the RNNLM parameters are fine-tuned to an individual user's wording patterns by incorporating social texts posted by the target user and his or her friends. For the feature-based approach, the RNNLM model parameters are fixed across users, but the RNNLM input features are instead augmented with personalized information. Both approaches not only drastically reduce the model perplexity, but also moderately reduce word error rates in n-best rescoring tests.

引用

页码：519 / 530

页数：12

共 63 条

[51] From Feedforward to Recurrent LSTM Neural Networks for Language Modeling [J].

Sundermeyer, Martin ;

Ney, Hermann ;

Schlueter, Ralf .

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (03) :517-529

[52]

Tam YC, 2007, INT CONF ACOUST SPEE, P41

[53]

Tam YC, 2006, INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, P2206

[54]

Tseng BH, 2015, 2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), P84, DOI 10.1109/ASRU.2015.7404778

[55]

Tseng C.-Y., 2010, P INT C WORKSH TEFL, P60

[56] A model of a trust-based recommendation system on a social network [J].

Walter, Frank Edward ;

Battiston, Stefano ;

Schweitzer, Frank .

AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2008, 16 (01) :57-74

[57]

Wen TH, 2013, INTERSPEECH, P2702

[58]

Wen TH, 2012, IEEE W SP LANG TECH, P188, DOI 10.1109/SLT.2012.6424220

[59]

Woodland P.C., 2001, ISCA Workshop on Adaptation, P11

[60]

Wu Jun, 1999, P EUR 1999, V99, P2179

← 1 2 3 4 5 6 7 →