Operation-Augmented Numerical Reasoning for Question Answering

被引：0

作者：

Zhou, Yongwei ^{[1
]}

Bao, Junwei ^{[2
]}

Wu, Youzheng ^{[2
]}

He, Xiaodong ^{[2
]}

Zhao, Tiejun ^{[1
]}

机构：

[1] Harbin Inst Technol, Sch Comp Sci & Technol, Machine Intelligence & Translat Lab, Harbin 150001, Peoples R China

[2] JD AI Res, Beijing 101111, Peoples R China

来源：

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2024年 / 32卷

基金：

中国国家自然科学基金;

关键词：

Cognition; Task analysis; Semantics; Speech processing; Sorting; Question answering (information retrieval); Predictive models; Numerical reasoning; symbolic operations; semantic augmentation; mixture-of-experts;

D O I：

10.1109/TASLP.2023.3316448

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Question answering requiring numerical reasoning, which generally involves symbolic operations such as sorting, counting, and addition, is a challenging task. To address such a problem, existing mixture-of-experts (MoE)-based methods design several specific answer predictors to handle different types of questions and achieve promising performance. However, they ignore the modeling and exploitation of fine-grained reasoning-related operations to support numerical reasoning, encountering the inadequacy in reasoning capability and interpretability. To alleviate this issue, we propose OPERA, an operation-augmented numerical reasoning framework. Concretely, we systematically define a scalable operation set to model numerical reasoning. We first identify reasoning-related operations based on context and then softly execute them to imitate the answer reasoning procedure via an operation-aware cross-attention mechanism. Finally, we utilize the operation-augmented semantic representation of execution results to support answer prediction. We verify the effectiveness and generalization of OPERA in two scenarios with different knowledge sources and reasoning capabilities. Specifically, we conduct extensive experiments on two textual datasets, DROP and RACENum, and a table-text hybrid dataset TAT-QA. Experiment results show that OPERA outperforms previous strong methods on the DROP, RACENum, and TAT-QA datasets. Further, we statistically and visually analyze its interpretability.

引用

页码：15 / 28

页数：14

共 63 条

[1]

Andor D, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P5947

[2] Neural Module Networks [J].

Andreas, Jacob ;

Rohrbach, Marcus ;

Darrell, Trevor ;

Klein, Dan .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :39-48

[3]

Ba J, 2014, ACS SYM SER

[4]

Ba J.L., 2016, arXiv preprint arXiv:1607.06450, DOI DOI 10.48550/ARXIV.1607.06450

[5]

Bao J., 2016, P COLING 2016 26 INT, P2503

[6]

Cai Q., 2013, P 51 ANN M ASS COMP, P423

[7]

Chen KL, 2020, PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), P6759

[8]

Chen W., 2020, P INT C LEARN REPR

[9]

Chen WH, 2023, Arxiv, DOI arXiv:2211.12588

[10]

Chen WH, 2020, FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, P1026

← 1 2 3 4 5 6 7 →