Utterance Censorship of Online Reinforcement Learning Chatbot

被引:3
作者
Chai, Yixuan [1 ]
Liu, Guohua [1 ]
机构
[1] Donghua Univ, Sch Comp Sci & Technol, Shanghai, Peoples R China
来源
2018 IEEE 30TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI) | 2018年
关键词
utterance censorship; reinforcement learning chatbot; character-level LSTM;
D O I
10.1109/ICTAI.2018.00063
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Researchers have applied online deep reinforcement learning in order to enhance the open-domain conversational skills of chatbots. These chatbots have the ability to learn conversations from real users but in practical applications, some users may take advantage of the chatbot's online learning ability to generate offensive responses. In this paper, we introduce an utterance censorship system to check whether the chatbot's utterance is appropriate. If the speech is inappropriate, the censor will block it and give a negative reward to "punish" the chatbot. The censorship system is based on a character-level bidirectional LSTM model, and the chatbot receiving the reward from the censorship system "forgets" the learned offensive utterances. Experimental results show that our proposed architecture enables online learning chatbots to self purify and that character-level LSTM is more appropriate for the utterance censorship task compared with classical word-level LSTM model.
引用
收藏
页码:358 / 362
页数:5
相关论文
共 17 条
[1]  
[Anonymous], 2016, CONTINUOUSLY LEARNIN
[2]  
[Anonymous], DEEP ACTIVE LEARNING
[3]  
[Anonymous], 2011, P INT AAAI C WEB SOC
[4]  
[Anonymous], 2012 IEEE SENSORS AP
[5]  
[Anonymous], 2009, CONTENT ANAL WEB
[6]  
[Anonymous], RECURRENT NEURAL NET
[7]  
[Anonymous], 1997, Neural Computation
[8]  
[Anonymous], DEEP REINFORCEMENT L
[9]  
[Anonymous], SMOKEY AUTOMATIC REC
[10]  
[Anonymous], 2016, PROC C EMPIR METHODS