Coarse-grained decomposition and fine-grained interaction for multi-hop question answering

被引:13
作者
Cao, Xing [1 ,2 ]
Liu, Yun [1 ,2 ]
机构
[1] Beijing Jiaotong Univ, Sch Elect & Informat Engn, Beijing 100044, Peoples R China
[2] Beijing Municipal Commiss Educ, Key Lab Commun & Informat Syst, Beijing 100044, Peoples R China
关键词
Question answering; Complex questions; Coarse-grained complex question decomposition; Fine-grained interaction;
D O I
10.1007/s10844-021-00645-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, question answering (QA) and reading comprehension (RC) has attracted much attention, and most research on QA has focused on multi-hop QA task which requires connecting multiple pieces of evidence scattered in a long context to answer the question. The key to the multi-hop QA task is semantic feature interaction between documents and questions, which is widely processed by Bi-directional Attention Flow (Bi-DAF), but Bi-DAF generally captures only the surface semantics of words in complex questions, and fails to capture implied semantic feature of intermediate answers, as well as ignoring parts of contexts related to the question and failing to extract the most important parts of multiple documents. In this paper, we propose a new model architecture for multi-hop question answering by applying two completion strategies:(1) Coarse-Grained complex question Decomposition (CGDe) strategy is introduced to decompose complex questions into simple ones without any additional annotations; (2) Fine-Grained Interaction (FGIn) strategy is introduced to explicitly represent each word in documents and extract more comprehensive and accurate sentences related to the inference path. The above two strategies are combined and tested on the SQuAD and HotpotQA datasets, and the experimental results show that our method outperforms state-of-the-art baselines.
引用
收藏
页码:21 / 41
页数:21
相关论文
共 42 条
[1]  
Bhargav GPS, 2020, AAAI CONF ARTIF INTE, V34, P7700
[2]  
Cao Y, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P357
[3]   Reading Wikipedia to Answer Open-Domain Questions [J].
Chen, Danqi ;
Fisch, Adam ;
Weston, Jason ;
Bordes, Antoine .
PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, :1870-1879
[4]  
CHEN H, 2017, ACM SIGKDD EXPLOR NE, V19, P25, DOI [DOI 10.1145/3166054.3166058, 10.1145/3166054.3166058]
[5]  
Chung J., 2014, ARXIV
[6]  
cisky Tomas Ko., 2018, Transactions of the Association for Computational Linguistics, V6, P317, DOI DOI 10.1162/TACLA00023
[7]   Attention-over-Attention Neural Networks for Reading Comprehension [J].
Cui, Yiming ;
Chen, Zhipeng ;
Wei, Si ;
Wang, Shijin ;
Liu, Ting ;
Hu, Guoping .
PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, :593-602
[8]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[9]   A survey on question answering systems over linked data and documents [J].
Dimitrakis, Eleftherios ;
Sgontzos, Konstantinos ;
Tzitzikas, Yannis .
JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2020, 55 (02) :233-259
[10]  
Fang YW, 2020, PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), P8823