Knowledge Graph-Based Reinforcement Federated Learning for Chinese Question and Answering

被引:5
作者
Xu, Liang [1 ]
Chen, Tao [2 ]
Hou, Zhaoxiang [3 ]
Zhang, Weishan [2 ]
Hon, Chitin [4 ]
Wang, Xiao [5 ]
Wang, Di [4 ]
Chen, Long [6 ]
Zhu, Wenyin [7 ,8 ]
Tian, Yunlong [7 ,8 ]
Ning, Huansheng [1 ]
Wang, Fei-Yue [9 ]
机构
[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing 100000, Peoples R China
[2] China Univ Petr, Sch Comp Sci & Technol, Qingdao 266580, Peoples R China
[3] ENN Grp, Digital Res Inst, Langfang 065001, Peoples R China
[4] Macau Univ Sci & Technol, Fac Innovat Engn, Macau, Peoples R China
[5] Anhui Univ, Sch Artificial Intelligence, Hefei 266114, Anhui, Peoples R China
[6] Univ Macau, Dept Comp & Informat Sci, Macau, Peoples R China
[7] Inst State Key Lab Digital Household Appliances, Qingdao 266101, Peoples R China
[8] Haier Grp, Technol Res & Dev Ctr, Qingdao 266101, Peoples R China
[9] Chinese Acad Sci, Inst Automat, Beijing 100000, Peoples R China
基金
中国国家自然科学基金;
关键词
Data models; Semantics; Federated learning; Knowledge graphs; Data privacy; Transformers; Task analysis; Knowledge graph; multitask semantic parsing MSP-bidirectional and auto-regressive transformers (BART); prompt learning; question and answering (Q&A); reinforcement federated learning (RFL);
D O I
10.1109/TCSS.2023.3246795
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Knowledge question and answering (Q & A) is widely used. However, most existing semantic parsing methods in Q & A usually use cascading, which can incur error accumulation. In addition, using only one institution's Q & A data definitely will limit the Q & A performance, while data privacy prevents sharing between institutions. This article proposes a knowledge graph-based reinforcement federated learning (KGRFL)-based Q & A approach to address these challenges. We design an end-to-end multitask semantic parsing model MSP-bidirectional and auto-regressive transformers (BART) that identifies question categories while converting questions into SPARQL statements to improve semantic parsing. Meanwhile, a reinforcement learning (RL)-based model fusion strategy is proposed to improve the effectiveness of federated learning, which enables multi-institution joint modeling and data privacy protection using cross-domain knowledge. In particular, it also reduces the negative impact of low-quality clients on the global model. Furthermore, a prompt learning-based entity disambiguation method is proposed to address the semantic ambiguity problem because of joint modeling. The experiments show that the proposed method performs well on different datasets. The Q & A results of the proposed approach outperform the approach of using only a single institution. Experiments also demonstrate that the proposed approach is resilient to security attacks, which is required for real applications.
引用
收藏
页码:1035 / 1045
页数:11
相关论文
共 42 条
[1]  
Andrychowicz Marcin, 2017, Advances in neural information processing systems, V30
[2]  
Cao S., 2020, ARXIV
[3]   Probabilistic Analysis of Electric Vehicle Energy Consumption Using MPC Speed Control and Nonlinear Battery Model [J].
Chen, Jun ;
Liang, Man ;
Ma, Xu .
2021 13TH ANNUAL IEEE GREEN TECHNOLOGIES CONFERENCE GREENTECH 2021, 2021, :181-186
[4]  
Chen Q., 2016, ARXIV
[5]  
Chen Q., 2019, ARXIV
[6]  
Devlin Jacob, 2018, 181004805 ARXIV
[7]  
Fortunato Meire, 2017, arXiv
[8]  
Hao Y, 2019, AAAI CONF ARTIF INTE, P938
[9]  
Huang PS, 2013, PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), P2333
[10]  
Konda VR, 2000, ADV NEUR IN, V12, P1008