Reproducing a Neural Question Answering Architecture Applied to the SQuAD Benchmark Dataset: Challenges and Lessons Learned

被引:2
|
作者
Duer, Alexander [1 ]
Rauber, Andreas [1 ]
Filzmoser, Peter [1 ]
机构
[1] Vienna Univ Technol, Vienna, Austria
来源
ADVANCES IN INFORMATION RETRIEVAL (ECIR 2018) | 2018年 / 10772卷
关键词
D O I
10.1007/978-3-319-76941-7_8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reproducibility is one of the pillars of scientific research. This study attempts to reproduce the Gated Self-Matching Network, which is the basis of one of the best performing models on the SQuAD dataset. We reimplement the neural network model and highlight ambiguities in the original architectural description. We show that due to uncertainty about only two components of the neural network model and no precise description of the training process, it is not possible to reproduce the experimental results obtained by the original implementation. Finally we summarize what we learned from this reproduction process about writing precise neural network architecture descriptions, providing our implementation as a basis for future exploration.
引用
收藏
页码:102 / 113
页数:12
相关论文
共 36 条
  • [21] Applying empirical software engineering to software architecture: challenges and lessons learned
    Davide Falessi
    Muhammad Ali Babar
    Giovanni Cantone
    Philippe Kruchten
    Empirical Software Engineering, 2010, 15 : 250 - 276
  • [22] Applying empirical software engineering to software architecture: challenges and lessons learned
    Falessi, Davide
    Babar, Muhammad Ali
    Cantone, Giovanni
    Kruchten, Philippe
    EMPIRICAL SOFTWARE ENGINEERING, 2010, 15 (03) : 250 - 276
  • [23] ArchivalQA: A Large-scale Benchmark Dataset for Open-Domain Question Answering over Historical News Collections
    Wang, Jiexin
    Jatowt, Adam
    Yoshikawa, Masatoshi
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 3025 - 3035
  • [24] Question Answering over Electronic Devices: A New Benchmark Dataset and a Multi-Task Learning based QA Framework
    Nandy, Abhilash
    Sharma, Soumya
    Maddhashiya, Shubham
    Sachdeva, Kapil
    Goyal, Pawan
    Ganguly, Niloy
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 4600 - 4609
  • [25] The Challenges in Developing an Online Applied Statistics Program: Lessons Learned at Penn State University
    Young, Derek S.
    Johnson, Glenn F.
    Chow, Mosuk
    Rosenberger, James L.
    AMERICAN STATISTICIAN, 2015, 69 (03) : 213 - 220
  • [26] Lessons learned to enable question answering on knowledge graphs extracted from scientific publications: A case study on the coronavirus literature
    Badenes-Olmedo, Carlos
    Corcho, Oscar
    JOURNAL OF BIOMEDICAL INFORMATICS, 2023, 142
  • [27] Revisiting a QoE Assessment Architecture Six Years Later: Lessons Learned and Remaining Challenges
    Dalal, Amy Csizmar
    QUALITY OF SERVICE IN HETEROGENEOUS NETWORKS, 2009, 22 : 730 - 738
  • [28] Challenges and opportunities in online education in Architecture: Lessons learned for Post-Pandemic education
    Asfour, Omar S.
    Alkharoubi, Amer M.
    AIN SHAMS ENGINEERING JOURNAL, 2023, 14 (09)
  • [29] Investigating the Use of Convolutional Neural Networks in Diagnosis of Microinvasive Carcinoma of the Breast: Lessons Learned in a Small Dataset
    Gayhart, Matthew
    Cartagena, Lorraine Colon
    Zot, Patricija
    Robila, Valentina
    MODERN PATHOLOGY, 2020, 33 (SUPPL 2) : 148 - 149
  • [30] Administration of the American Board of Anesthesiology's virtual APPLIED Examination: successes, challenges, and lessons learned
    Keegan, Mark T.
    Harman, Ann E.
    McLoughlin, Thomas M.
    Macario, Alex
    Deiner, Stacie G.
    Gaiser, Robert R.
    Warner, David O.
    Suresh, Santhanam
    Sun, Huaping
    BMC MEDICAL EDUCATION, 2024, 24 (01)