Reproducing a Neural Question Answering Architecture Applied to the SQuAD Benchmark Dataset: Challenges and Lessons Learned

被引:2
作者
Duer, Alexander [1 ]
Rauber, Andreas [1 ]
Filzmoser, Peter [1 ]
机构
[1] Vienna Univ Technol, Vienna, Austria
来源
ADVANCES IN INFORMATION RETRIEVAL (ECIR 2018) | 2018年 / 10772卷
关键词
D O I
10.1007/978-3-319-76941-7_8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reproducibility is one of the pillars of scientific research. This study attempts to reproduce the Gated Self-Matching Network, which is the basis of one of the best performing models on the SQuAD dataset. We reimplement the neural network model and highlight ambiguities in the original architectural description. We show that due to uncertainty about only two components of the neural network model and no precise description of the training process, it is not possible to reproduce the experimental results obtained by the original implementation. Finally we summarize what we learned from this reproduction process about writing precise neural network architecture descriptions, providing our implementation as a basis for future exploration.
引用
收藏
页码:102 / 113
页数:12
相关论文
共 36 条
  • [31] Administration of the American Board of Anesthesiology's virtual APPLIED Examination: successes, challenges, and lessons learned
    Keegan, Mark T.
    Harman, Ann E.
    McLoughlin, Thomas M.
    Macario, Alex
    Deiner, Stacie G.
    Gaiser, Robert R.
    Warner, David O.
    Suresh, Santhanam
    Sun, Huaping
    BMC MEDICAL EDUCATION, 2024, 24 (01)
  • [32] Validating the Smart Grid Architecture Model for Sustainable Energy Community Implementation: Challenges, Solutions, and Lessons Learned
    Janev, Valentina
    Berbakov, Lazar
    Tomasevic, Nikola
    Sotoca, Jesus Martin-Borja
    Lujan, Sergio
    ENERGIES, 2025, 18 (03)
  • [33] Challenges Encountered and Lessons Learned when Using a Novel Anonymised Linked Dataset of Health and Social Care Records for Public Health Intelligence: The Sussex Integrated Dataset
    Ford, Elizabeth
    Tyler, Richard
    Johnston, Natalie
    Spencer-Hughes, Vicki
    Evans, Graham
    Elsom, Jon
    Madzvamuse, Anotida
    Clay, Jacqueline
    Gilchrist, Kate
    Rees-Roberts, Melanie
    INFORMATION, 2023, 14 (02)
  • [34] KARI and NASA JS']JSC Collaborative Endeavors for Joint Korea Pathfinder Lunar Orbiter Flight Dynamics Operations: Architecture, Challenges, Successes, and Lessons Learned
    Song, Young-Joo
    Bae, Jonghee
    Hong, SeungBum
    Bang, Jun
    Pohlkamp, Kara M.
    Fuller, Shane
    AEROSPACE, 2023, 10 (08)
  • [35] Technical Solution Discussion for Key Challenges of Operational Convolutional Neural Network-Based Building-Damage Assessment from Satellite Imagery: Perspective from Benchmark xBD Dataset
    Su, Jinhua
    Bai, Yanbing
    Wang, Xingrui
    Lu, Dong
    Zhao, Bo
    Yang, Hanfang
    Mas, Erick
    Koshimura, Shunichi
    REMOTE SENSING, 2020, 12 (22) : 1 - 25
  • [36] ViOCRVQA: novel benchmark dataset and VisionReader for visual question answering by understanding Vietnamese text in imagesViOCRVQA: novel benchmark dataset and VisionReader for visual…\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\ldots $$\end{document}H. Q. Pham et al.
    Huy Quang Pham
    Thang Kien-Bao Nguyen
    Quan Van Nguyen
    Dan Quang Tran
    Nghia Hieu Nguyen
    Kiet Van Nguyen
    Ngan Luu-Thuy Nguyen
    Multimedia Systems, 2025, 31 (2)