Learning to Generate Questions by Learning to Recover Answer-containing Sentences

被引:0
|
作者
Back, Seohyun [1 ,2 ]
Kedia, Akhil [1 ]
Chinthakindi, Sai Chetan [1 ]
Lee, Haejun [1 ]
Choo, Jaegul [2 ]
机构
[1] Samsung Res, Seoul, South Korea
[2] Korea Adv Inst Sci & Technol, Daejeon, South Korea
基金
新加坡国家研究基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To train a question answering model based on machine reading comprehension (MRC), significant effort is required to prepare annotated training data composed of questions and their answers from contexts. Recent research has focused on synthetically generating a question from a given context and an annotated (or generated) answer by training an additional generative model to augment the training data. In light of this research direction, we propose a novel pre-training approach that learns to generate contextually rich questions, by recovering answer-containing sentences. We evaluate our method against existing ones in terms of the quality of generated questions, and fine-tuned MRC model accuracy after training on the data synthetically generated by our method. We consistently improve the question generation capability of existing models such as T5 and UniLM, and achieve state-of-the-art results on MS MARCO and NewsQA, and comparable results to the state-of-the-art on SQuAD. Additionally, the data synthetically generated by our approach is beneficial for boosting up the downstream MRC accuracy across a wide range of datasets, such as SQuAD-v1.1, v2.0, KorQuAD and BioASQ, without any modification to the existing MRC models. Furthermore, our method shines especially when a limited amount of pre-training or downstream MRC data is given.
引用
收藏
页码:1516 / 1529
页数:14
相关论文
共 50 条
  • [21] Answer extraction for definition questions using information gain and machine learning
    Martinez-Gil, Carmen
    Lopez-Lopez, A.
    ARTIFICIAL INTELLIGENCE IN THEORY AND PRACTICE II, 2008, 276 : 141 - 150
  • [22] Learning to Answer Questions from Image Using Convolutional Neural Network
    Ma, Lin
    Lu, Zhengdong
    Li, Hang
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 3567 - 3573
  • [23] Answer Extraction for Definition Questions Using Information Gain and Machine Learning
    Instituto Nacional de Astrofísica Óptica y Electrónica, Universidad de la Sierra Juárez, Mexico
    不详
    72840, Mexico
    IFIP Advances in Information and Communication Technology, 2008, (141-150)
  • [24] Just Ask: Learning to Answer Questions from Millions of Narrated Videos
    Yang, Antoine
    Miech, Antoine
    Sivic, Josef
    Laptev, Ivan
    Schmid, Cordelia
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 1666 - 1677
  • [25] Learning from Unannotated QA Pairs to Analogically Disambiguate and Answer Questions
    Crouse, Maxwell
    McFate, Clifton
    Forbus, Kenneth
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 654 - 662
  • [26] Learning to Answer Complex Questions over Knowledge Bases with Query Composition
    Bhutani, Nikita
    Zheng, Xinyi
    Jagadish, H. V.
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 739 - 748
  • [27] Learning to complete sentences
    Bickel, S
    Haider, P
    Scheffer, T
    MACHINE LEARNING: ECML 2005, PROCEEDINGS, 2005, 3720 : 497 - 504
  • [28] GVQA: Learning to Answer Questions about Graphs with Visualizations via Knowledge Base
    Song, Sicheng
    Chen, Juntong
    Li, Chenhui
    Wang, Changbo
    PROCEEDINGS OF THE 2023 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2023, 2023,
  • [29] Learning to answer programming questions with software documentation through social context embedding
    Li, Jing
    Sun, Aixin
    Xing, Zhenchang
    INFORMATION SCIENCES, 2018, 448 : 36 - 52
  • [30] Learning to ask and answer important questions: An investigative laboratory for general chemistry.
    Lloyd, BW
    Sarquis, AM
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2000, 219 : U427 - U428