Graph augmented sequence-to-sequence model for neural question generation

被引:0
作者
Hui Ma
Jian Wang
Hongfei Lin
Bo Xu
机构
[1] Dalian University of Technology,School of Computer Science and Technology
来源
Applied Intelligence | 2023年 / 53卷
关键词
Question generation; Sequence-to-sequence; Graph neural network; Recurrent neural network; Answer information;
D O I
暂无
中图分类号
学科分类号
摘要
Neural question generation (NQG) aims to generate a question from a given passage with neural networks. NQG has attracted more attention in recent years, due to its wide applications in reading comprehension, question answering, and dialogue systems. Existing works on NQG mainly use the sequence-to-sequence (Seq2Seq) or graph-to-sequence (Graph2Seq) framework. The former ignores rich structure information of the passage, while the latter is insufficient in modeling semantic information. Moreover, the target answer plays an important role in the task, because without the answer the generated question has great randomness. To effectively utilize answer information and capture both structure and semantic information of the passage, we propose a graph augmented sequence-to-sequence (GA-Seq2Seq) model. Firstly, we design an answer-aware passage representation module to integrate the answer information into the passage. Then, to discover both the structure and semantic information of the passage, we present a graph augmented passage encoder which consists of a graph encoder and a sequence encoder. Finally, we leverage an attention-based long short-term memory decoder to generate the question. Experimental results on the SQuAD and MS MARCO datasets show that our proposed model outperforms the existing state-of-the-art baselines in terms of automatic and human evaluations. The implementation is available at https://github.com/butterfliesss/GA-Seq2Seq.
引用
收藏
页码:14628 / 14644
页数:16
相关论文
共 8 条
[1]  
Zeng H(2021)Improving paragraph-level question generation with extended answer network and uncertainty-aware beam search Inf Sci 571 50-64
[2]  
Zhi Z(2005)Framewise phoneme classification with bidirectional lstm and other neural network architectures Neural Netw 18 602-610
[3]  
Liu J(1997)Long short-term memory Neural Computat 9 1735-1780
[4]  
Wei B(undefined)undefined undefined undefined undefined-undefined
[5]  
Graves A(undefined)undefined undefined undefined undefined-undefined
[6]  
Schmidhuber J(undefined)undefined undefined undefined undefined-undefined
[7]  
Hochreiter S(undefined)undefined undefined undefined undefined-undefined
[8]  
Schmidhuber J(undefined)undefined undefined undefined undefined-undefined