Towards Robust Temporal Reasoning of Large Language Models via a Multi-Hop QA Dataset and Pseudo-Instruction Tuning

被引:0
作者
Tan, Qingyu [1 ,2 ]
Ng, Hwee Tou [2 ]
Bing, Lidong [1 ]
机构
[1] DAMO Acad, Alibaba Grp, Beijing, Peoples R China
[2] Natl Univ Singapore, Dept Comp Sci, Singapore, Singapore
来源
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024 | 2024年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge in the real world is being updated constantly. However, it is costly to frequently update large language models (LLMs). Therefore, it is crucial for LLMs to understand the concept of temporal knowledge. However, prior works on temporal question answering (TQA) did not emphasize multi-answer and multi-hop types of temporal reasoning. In this paper, we propose a complex temporal question-answering dataset Complex-TR that focuses on multi-answer and multi-hop temporal reasoning. Besides, we also propose a novel data augmentation strategy to improve the complex temporal reasoning capability and robustness of LLMs. We conducted experiments on multiple temporal QA datasets. Experimental results show that our method is able to improve LLMs' performance on temporal QA benchmarks by significant margins(1).
引用
收藏
页码:6272 / 6286
页数:15
相关论文
共 45 条
[1]  
2023, Arxiv, DOI [arXiv:2303.08774, DOI 10.48550/ARXIV.2303.08774, 10.48550/arXiv.2303.08774]
[2]  
Amouyal SJ, 2022, Arxiv, DOI arXiv:2205.12665
[3]   Multi-hop reasoning over paths in temporal knowledge graphs using reinforcement learning [J].
Bai, Luyi ;
Yu, Wenting ;
Chen, Mingzhuo ;
Ma, Xiangnan .
APPLIED SOFT COMPUTING, 2021, 103
[4]  
Bai Luyi, 2023, Expert Sys-tems with Applications
[5]  
Bajaj P, 2016, arXiv preprint arXiv: 1611.09268
[6]  
Barbaresi A, 2021, ACL-IJCNLP 2021: THE JOINT CONFERENCE OF THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING: PROCEEDINGS OF THE SYSTEM DEMONSTRATIONS, P122
[7]  
Cao Hannan, 2023, P EAC
[8]  
Cao Hannan, 2023, P EAC
[9]  
Cassidy Taylor, 2014, P ACL
[10]  
Chen Wenhu, 2021, P NEURIP