Deep learning-based bridge damage cause estimation from multiple images using visual question answering

被引:6
作者
Yamane, Tatsuro [1 ]
Chun, Pang-jo [2 ]
Dang, Ji [3 ]
Okatani, Takayuki [4 ,5 ]
机构
[1] Natl Inst Technol, Tokuyama Coll, Dept Civil Engn & Architecture, Yamaguchi, Japan
[2] Univ Tokyo, Dept Civil Engn, Tokyo, Japan
[3] Saitama Univ, Dept Civil & Environm Engn, Saitama, Japan
[4] Tohoku Univ, Dept Syst Informat Sci, Sendai, Miyagi, Japan
[5] RIKEN, Ctr AIP, Tokyo, Japan
关键词
Deep learning; bridges; damages; maintenance; structure from motion; visual question answering; CRACK DETECTION; NEURAL-NETWORK; PREDICTION; MODEL; INSPECTION;
D O I
10.1080/15732479.2024.2355929
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
This paper presents a framework for estimating the cause of damage to bridge members by combining Structure from Motion (SfM) and Visual Question Answering (VQA) techniques. A VQA model was developed that uses bridge images for dataset creation and outputs the damage or member name and its existence based on the images and questions. In the developed model, the correct answer rate for questions requiring the member or damage name were 67.4 and 68.9%, respectively. The correct answer rate for questions requiring a yes/no answer was 99.1%. Based on the developed model, a damage cause estimation method was proposed. In the proposed method, the damage causes are narrowed down by inputting new questions to the VQA model, which are determined based on the surrounding images obtained via SfM and the results of the VQA model. Subsequently, the proposed method was then applied to an actual bridge and shown to be capable of determining damage and estimating its cause. The proposed method could be used to prevent damage causes from being overlooked, and practitioners could determine inspection focus areas, which could contribute to the improvement of maintenance techniques. In the future, it is expected to contribute to infrastructure diagnosis automation.
引用
收藏
页数:14
相关论文
共 69 条
  • [1] Comparison of Visual Inspection and Structural-Health Monitoring As Bridge Condition Assessment Methods
    Agdas, Duzgun
    Rice, Jennifer A.
    Martinez, Justin R.
    Lasa, Ivan R.
    [J]. JOURNAL OF PERFORMANCE OF CONSTRUCTED FACILITIES, 2016, 30 (03)
  • [2] Amezquita-Sanchez JP, 2015, SCI IRAN, V22, P1931
  • [3] Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
    Anderson, Peter
    He, Xiaodong
    Buehler, Chris
    Teney, Damien
    Johnson, Mark
    Gould, Stephen
    Zhang, Lei
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6077 - 6086
  • [4] [Anonymous], 2021, ASCE's 2021 infrastructure report card
  • [5] VQA: Visual Question Answering
    Antol, Stanislaw
    Agrawal, Aishwarya
    Lu, Jiasen
    Mitchell, Margaret
    Batra, Dhruv
    Zitnick, C. Lawrence
    Parikh, Devi
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2425 - 2433
  • [6] Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
  • [7] Deep Learning-Based Crack Damage Detection Using Convolutional Neural Networks
    Cha, Young-Jin
    Choi, Wooram
    Buyukozturk, Oral
    [J]. COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2017, 32 (05) : 361 - 378
  • [8] Cho KYHY, 2014, Arxiv, DOI arXiv:1406.1078
  • [9] A deep learning-based image captioning method to automatically generate comprehensive explanations of bridge damage
    Chun, Pang-Jo
    Yamane, Tatsuro
    Maemura, Yu
    [J]. COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2022, 37 (11) : 1387 - 1401
  • [10] Automatic detection method of cracks from concrete surface imagery using two-step light gradient boosting machine
    Chun, Pang-jo
    Izumi, Shota
    Yamane, Tatsuro
    [J]. COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2021, 36 (01) : 61 - 72