Multi-Column Convolutional Neural Networks with Causality-Attention for Why-Question Answering

被引：33

作者：

Oh, Jong-Hoon ^{[1
]}

Torisawa, Kentaro ^{[1
]}

Kruengkrai, Canasai ^{[1
]}

Iida, Ryu ^{[1
]}

Kloetzer, Julien ^{[1
]}

机构：

[1] Natl Inst Informat & Commun Technol, Kyoto, Japan

来源：

WSDM'17: PROCEEDINGS OF THE TENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING | 2017年

关键词：

Question Answering; Convolutional Neural Network; Neural Attention; Causality; Why-Question Answering;

D O I：

10.1145/3018661.3018737

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Why-question answering (why-QA) is a task to retrieve answers (or answer passages) to why-questions (e.g., "why are tsunamis generated?") from a text archive. Several previously proposed methods for why-QA improved their performance by automatically recognizing causalities that are expressed with such explicit cues as "because" in answer passages and using the recognized causalities as a clue for finding proper answers. However, in answer passages, causalities might be implicitly expressed, (i.e., without any explicit cues): "An earthquake suddenly displaced sea water and a tsunami was generated." The previous works did not deal with such implicitly expressed causalities and failed to find proper answers that included the causalities. We improve why-QA based on the following two ideas. First, implicitly expressed causalities in one text might be expressed in other texts with explicit cues. If we can automatically recognize such explicitly expressed causalities from a text archive and use them to complement the implicitly expressed causalities in an answer passage, we can improve why-QA. Second, the causes of similar events tend to be described with a similar set of words (e.g., "seismic energy" and "tectonic plates" for "the Great East Japan Earthquake" and "the 1906 San Francisco Earthquake"). As such, even if we cannot find in a text archive any explicitly expressed cause of an event (e.g., "the Great East Japan Earthquake") expressed in a question (e.g., "Why did the Great East Japan earthquake happen?"), we might be able to identify its implicitly expressed causes with a set of words (e.g., "tectonic plates") that appear in the explicitly expressed cause of a similar event (e.g., "the 1906 San Francisco Earthquake"). We implemented these two ideas in our multi-column convolutional neural networks with a novel attention mechanism, which we call causality attention. Through experiments on Japanese why-QA, we confirmed that our proposed method outperformed the state-of-the-art systems.

引用

页码：415 / 424

页数：10

共 20 条

[1] Causality Analysis Method and Model Related to Why-Question Answering in Business Intelligence Context
Guessoum, Meriem Amel
Djiroun, Rahma
Boukhalfa, Kamel
ADVANCES IN COMPUTING SYSTEMS AND APPLICATIONS, 2022, 513 : 15 - 26
[2] Application of Multi-Column Heterogeneous Convolutional Neural Networks in image classification
Wang, Guo-Zhen
JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2019, 19 (02) : 307 - 316
[3] Incorporating Statistical Features in Convolutional Neural Networks for Question Answering with Financial Data
Shijia, E.
Xu, Shiyao
Xiang, Yang
COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018), 2018, : 1955 - 1959
[4] Multi-Column Atrous Convolutional Neural Network for Counting Metro Passengers
Zhang, Jun
Zhu, Gaoyi
Wang, Zhizhong
SYMMETRY-BASEL, 2020, 12 (04):
[5] Multi-scale and multi-column convolutional neural network for crowd density estimation
Chen, Lei
Wang, Guodong
Hou, Guojia
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (05) : 6661 - 6674
[6] Multi-image Crowd Counting Using Multi-column Convolutional Neural Network
Kurnaz, Oguzhan
Hanilci, Cemal
PROCEEDINGS OF SIXTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICICT 2021), VOL 2, 2022, 236 : 223 - 232
[7] Multi-scale and multi-column convolutional neural network for crowd density estimation
Lei Chen
Guodong Wang
Guojia Hou
Multimedia Tools and Applications, 2021, 80 : 6661 - 6674
[8] Smart connected electronic gastroscope system for gastric cancer screening using multi-column convolutional neural networks
Wang, Hao
Ding, Shuai
Wu, Desheng
Zhang, Youtao
Yang, Shanlin
INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2019, 57 (21) : 6795 - 6806
[9] Adversarial Entity Graph Convolutional Networks for multi-hop inference question answering
Du, Yongping
Yan, Rui
Hou, Ying
Pei, Yu
Han, Honggui
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 258
[10] Image-Based Crowd Stability Analysis Using Improved Multi-Column Convolutional Neural Network
Zhao, Rongyong
Dong, Daheng
Wang, Yan
Li, Cuiling
Ma, Yunlong
Fuentes Enriquez, Veronica
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (06) : 5480 - 5489

← 1 2 →