Reproducing a Neural Question Answering Architecture Applied to the SQuAD Benchmark Dataset: Challenges and Lessons Learned

被引：2

作者：

Duer, Alexander ^{[1
]}

Rauber, Andreas ^{[1
]}

Filzmoser, Peter ^{[1
]}

机构：

[1] Vienna Univ Technol, Vienna, Austria

来源：

ADVANCES IN INFORMATION RETRIEVAL (ECIR 2018) | 2018年 / 10772卷

关键词：

D O I：

10.1007/978-3-319-76941-7_8

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Reproducibility is one of the pillars of scientific research. This study attempts to reproduce the Gated Self-Matching Network, which is the basis of one of the best performing models on the SQuAD dataset. We reimplement the neural network model and highlight ambiguities in the original architectural description. We show that due to uncertainty about only two components of the neural network model and no precise description of the training process, it is not possible to reproduce the experimental results obtained by the original implementation. Finally we summarize what we learned from this reproduction process about writing precise neural network architecture descriptions, providing our implementation as a basis for future exploration.

引用

页码：102 / 113

页数：12

共 36 条

[21] Applying empirical software engineering to software architecture: challenges and lessons learned
Davide Falessi
Muhammad Ali Babar
Giovanni Cantone
Philippe Kruchten
Empirical Software Engineering, 2010, 15 : 250 - 276
[22] Applying empirical software engineering to software architecture: challenges and lessons learned
Falessi, Davide
Babar, Muhammad Ali
Cantone, Giovanni
Kruchten, Philippe
EMPIRICAL SOFTWARE ENGINEERING, 2010, 15 (03) : 250 - 276
[23] ArchivalQA: A Large-scale Benchmark Dataset for Open-Domain Question Answering over Historical News Collections
Wang, Jiexin
Jatowt, Adam
Yoshikawa, Masatoshi
PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 3025 - 3035
[24] Question Answering over Electronic Devices: A New Benchmark Dataset and a Multi-Task Learning based QA Framework
Nandy, Abhilash
Sharma, Soumya
Maddhashiya, Shubham
Sachdeva, Kapil
Goyal, Pawan
Ganguly, Niloy
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 4600 - 4609
[25] The Challenges in Developing an Online Applied Statistics Program: Lessons Learned at Penn State University
Young, Derek S.
Johnson, Glenn F.
Chow, Mosuk
Rosenberger, James L.
AMERICAN STATISTICIAN, 2015, 69 (03) : 213 - 220
[26] Lessons learned to enable question answering on knowledge graphs extracted from scientific publications: A case study on the coronavirus literature
Badenes-Olmedo, Carlos
Corcho, Oscar
JOURNAL OF BIOMEDICAL INFORMATICS, 2023, 142
[27] Revisiting a QoE Assessment Architecture Six Years Later: Lessons Learned and Remaining Challenges
Dalal, Amy Csizmar
QUALITY OF SERVICE IN HETEROGENEOUS NETWORKS, 2009, 22 : 730 - 738
[28] Challenges and opportunities in online education in Architecture: Lessons learned for Post-Pandemic education
Asfour, Omar S.
Alkharoubi, Amer M.
AIN SHAMS ENGINEERING JOURNAL, 2023, 14 (09)
[29] Investigating the Use of Convolutional Neural Networks in Diagnosis of Microinvasive Carcinoma of the Breast: Lessons Learned in a Small Dataset
Gayhart, Matthew
Cartagena, Lorraine Colon
Zot, Patricija
Robila, Valentina
MODERN PATHOLOGY, 2020, 33 (SUPPL 2) : 148 - 149
[30] Administration of the American Board of Anesthesiology's virtual APPLIED Examination: successes, challenges, and lessons learned
Keegan, Mark T.
Harman, Ann E.
McLoughlin, Thomas M.
Macario, Alex
Deiner, Stacie G.
Gaiser, Robert R.
Warner, David O.
Suresh, Santhanam
Sun, Huaping
BMC MEDICAL EDUCATION, 2024, 24 (01)

← 1 2 3 4 →