An reinforcement learning-based speech censorship chatbot system

被引:9
作者
Cai, Shaokang [1 ]
Han, Dezhi [1 ]
Li, Dun [1 ,3 ]
Zheng, Zibin [2 ]
Crespi, Noel [3 ]
机构
[1] Shanghai Maritime Univ, Coll Informat Engn, Shanghai 201306, Peoples R China
[2] Sun Yat Sen Univ, Sch Software Engn, Zhuhai 519082, Peoples R China
[3] Inst Polytech Paris, IMT, Telecom SudParis, F-91000 Paris, France
基金
中国国家自然科学基金; 上海市自然科学基金;
关键词
Chatbots; Reinforcement Learning; Speech Censorship; Bi-GRU; ARTIFICIAL-INTELLIGENCE; OFFENSIVE LANGUAGE; MODEL;
D O I
10.1007/s11227-021-04251-z
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The rapid development of artificial intelligence (AI) technology has enabled large-scale AI applications to land in the market and practice. However, plenty of security issues have been exposed to society while AI technology has brought many conveniences to humankind, especially for the chatbot with online learning. This paper proposes a speech censorship chatbot system with reinforcement learning, which is mainly composed of two parts: the aggressive speech censorship model and the speech purification model. The aggressive speech censorship can combine the context of user input sentences to detect aggressive speech and respond to the rapid evolution of aggressive speech. According to the situation of the chatbot that is polluted by large numbers of aggressive speech, the speech purification model has the capacity to "forget" the learned malicious data through reinforcement learning rather than rolling back to the early versions. In addition, by integrating few-shot learning, the speed of speech purification is accelerated while reducing the influence on the quality of replies. The experimental results show that our proposed method reduces the probability of generating aggressive speeches and that the integration of the few-shot learning improves the training speed rapidly while effectively slowing down the decline in BLEU values.
引用
收藏
页码:8751 / 8773
页数:23
相关论文
共 34 条
  • [1] Abel D., 2017, ARXIV170104079
  • [2] Adamopoulou E., 2020, ARTIF INTELL, P373, DOI DOI 10.1007/978-3-030-49186-4_31
  • [3] Detecting sentences that may be harmful to children with special needs
    Allouch, Merav
    Azaria, Amos
    Azoulay, Rina
    [J]. 2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 1209 - 1213
  • [4] Asghar, 2016, ARXIV PREPRINT ARXIV
  • [5] Chang C-C, MFVTAN ANOMALY TRAFF
  • [6] Chkroun, 2018, WORKSH 32 AAAI C ART, P695
  • [7] Dadvar Maral, 2013, Advances in Information Retrieval. 35th European Conference on IR Research, ECIR 2013. Proceedings, P693, DOI 10.1007/978-3-642-36973-5_62
  • [8] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
  • [9] Dinakar K, 2011, P INT AAAI C WEB SOC, VWS-11-02, P11, DOI DOI 10.1609/ICWSM.V5I3.14209
  • [10] Du JC, 2017, 2017 INTERNATIONAL CONFERENCE ON SECURITY, PATTERN ANALYSIS, AND CYBERNETICS (SPAC), P445, DOI 10.1109/SPAC.2017.8304320