Nonfactoid Question Answering as Query-Focused Summarization With Graph-Enhanced Multihop Inference

被引:25
作者
Deng, Yang [1 ]
Zhang, Wenxuan [2 ]
Xu, Weiwen [1 ]
Shen, Ying [3 ]
Lam, Wai [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Syst Engn & Engn Management, Hong Kong, Peoples R China
[2] DAMO Acad, Alibaba Grp, Singapore 189554, Singapore
[3] Sun Yat Sen Univ, Sch Intelligent Syst Engn, Guangzhou 510275, Peoples R China
基金
中国国家自然科学基金;
关键词
Cognition; Semantics; Spread spectrum communication; Question answering (information retrieval); Coherence; Aggregates; Task analysis; Graph neural network; multihop reasoning; nonfactoid question answering (QA); query-focused summarization;
D O I
10.1109/TNNLS.2023.3258413
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nonfactoid question answering (QA) is one of the most extensive yet challenging applications and research areas in natural language processing (NLP). Existing methods fall short of handling the long-distance and complex semantic relations between the question and the document sentences. In this work, we propose a novel query-focused summarization method, namely a graph-enhanced multihop query-focused summarizer (GMQS), to tackle the nonfactoid QA problem. Specifically, we leverage graph-enhanced reasoning techniques to elaborate the multihop inference process in nonfactoid QA. Three types of graphs with different semantic relations, namely semantic relevance, topical coherence, and coreference linking, are constructed for explicitly capturing the question-document and sentence-sentence interrelationships. Relational graph attention network (RGAT) is then developed to aggregate the multirelational information accordingly. In addition, the proposed method can be adapted to both extractive and abstractive applications as well as be mutually enhanced by joint learning. Experimental results show that the proposed method consistently outperforms both existing extractive and abstractive methods on two nonfactoid QA datasets, WikiHow and PubMedQA, and possesses the capability of performing explainable multihop reasoning.
引用
收藏
页码:11231 / 11245
页数:15
相关论文
共 81 条
  • [1] [Anonymous], 2018, P 27 INT C COMPUTATI
  • [2] Bauer L, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P4220
  • [3] Baumel T, 2016, AAAI CONF ARTIF INTE, P2573
  • [4] Bird S., 2009, Natural language processing with Python: analyzing text with the natural language toolkit
  • [5] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022
  • [6] Cao Z., 2016, P COLING 2016 26 INT
  • [7] Chao Shen, 2011, Proceedings of the 2011 IEEE 11th International Conference on Data Mining (ICDM 2011), P626, DOI 10.1109/ICDM.2011.91
  • [8] Chen YC, 2018, PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, P675
  • [9] Cheng JP, 2016, PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, P484
  • [10] Toward Personalized Answer Generation in E-Commerce via Multi-perspective Preference Modeling
    Deng, Yang
    Li, Yaliang
    Zhang, Wenxuan
    Ding, Bolin
    Lam, Wai
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2022, 40 (04)