Efficient matching of Transformer-enhanced features for accurate vision-based displacement measurement

被引:0
作者
Zhang, Haoyu [1 ,2 ]
Wu, Stephen [3 ,4 ]
Luo, Xiangyun [1 ,2 ]
Huang, Yong [1 ,2 ]
Li, Hui [1 ,2 ]
机构
[1] Harbin Inst Technol, Minist Ind & Informat Technol, Key Lab Smart Prevent & Mitigat Civil Engn Disaste, Harbin 150090, Peoples R China
[2] Harbin Inst Technol, Sch Civil Engn, Key Lab Struct Dynam Behav & Control, Minist Educ, Harbin 150090, Peoples R China
[3] Inst Stat Math, Res Org Informat & Syst, 10-3 Midori Cho, Tachikawa, Tokyo 1908562, Japan
[4] Grad Univ Adv Studies SOKENDAI, 10-3 Midori Cho, Tachikawa, Tokyo 1908562, Japan
基金
中国国家自然科学基金;
关键词
Attention mechanism; Subpixel feature matching; Transformer; Computer vision; Displacement measurement; Vibration monitoring; LEARNING OPTICAL-FLOW; DYNAMIC DISPLACEMENT; CIVIL INFRASTRUCTURE; COMPUTER VISION; ACCELERATION; PERFORMANCE; VELOCITY; FUSION; BRIDGE;
D O I
10.1016/j.autcon.2025.105962
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Computer vision technology and monitoring videos have been employed to obtain structural displacement measurements. Noniterative algorithms are mainly designed for rapid tracking of the motions of individual image points, rather than dense motion fields. Iterative algorithms are limited to estimating motion fields with small amplitudes and require high computation cost to achieve high accuracy. This paper introduces a non- iterative method for vision-based measurements that balances speed and density. The method employs an attention-based matching strategy applied to Transformer-enhanced image features. Motion priors and a physics- informed denoising approach are integrated to improve measurement accuracy. Tested on challenging truss and cable-stayed bridge vibration videos, the method demonstrated superior displacement measurement performance compared to conventional approaches. It also achieved greater robustness to brightness changes and partial occlusions while requiring minimal human intervention. This method supports the development of automated and affordable vibration monitoring systems.
引用
收藏
页数:18
相关论文
共 68 条
[51]   Advances in Computer Vision-Based Civil Infrastructure Inspection and Monitoring [J].
Spencer, Billie F., Jr. ;
Hoskere, Vedhus ;
Narazaki, Yasutaka .
ENGINEERING, 2019, 5 (02) :199-222
[52]   MEASUREMENTS OF STATIC AND DYNAMIC DISPLACEMENT FROM VISUAL MONITORING OF THE HUMBER BRIDGE [J].
STEPHEN, GA ;
BROWNJOHN, JMW ;
TAYLOR, CA .
ENGINEERING STRUCTURES, 1993, 15 (03) :197-208
[53]   PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume [J].
Sun, Deqing ;
Yang, Xiaodong ;
Liu, Ming-Yu ;
Kautz, Jan .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :8934-8943
[54]   LoFTR: Detector-Free Local Feature Matching with Transformers [J].
Sun, Jiaming ;
Shen, Zehong ;
Wang, Yuang ;
Bao, Hujun ;
Zhou, Xiaowei .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :8918-8927
[55]  
Teed Zachary, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12347), P402, DOI 10.1007/978-3-030-58536-5_24
[56]   SHAPE AND MOTION FROM IMAGE STREAMS UNDER ORTHOGRAPHY - A FACTORIZATION METHOD [J].
TOMASI, C ;
KANADE, T .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 1992, 9 (02) :137-154
[57]  
Vaswani A, 2017, ADV NEUR IN, V30
[58]   DeepFlow: Large displacement optical flow with deep matching [J].
Weinzaepfel, Philippe ;
Revaud, Jerome ;
Harchaoui, Zaid ;
Schmid, Cordelia .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :1385-1392
[59]   Local positioning systems versus structural monitoring: a review [J].
Wu, Lijun ;
Casciati, Fabio .
STRUCTURAL CONTROL & HEALTH MONITORING, 2014, 21 (09) :1209-1221
[60]   Unifying Flow, Stereo and Depth Estimation [J].
Xu, Haofei ;
Zhang, Jing ;
Cai, Jianfei ;
Rezatofighi, Hamid ;
Yu, Fisher ;
Tao, Dacheng ;
Geiger, Andreas .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) :13941-13958