Dual-correlate optimized coarse-fine strategy for monocular laparoscopic videos feature matching via multilevel sequential coupling feature descriptor

被引:0
作者
Zhang, Ziang [1 ]
Song, Hong [2 ]
Fan, Jingfan [3 ]
Fu, Tianyu [1 ]
Li, Qiang [2 ]
Ai, Danni [3 ]
Xiao, Deqaing [3 ]
Yang, Jian [3 ]
机构
[1] Beijing Inst Technol, Sch Med Technol, Beijing 100081, Peoples R China
[2] Beijing Inst Technol, Sch Comp Sci Technol, Beijing 100081, Peoples R China
[3] Beijing Inst Technol, Sch Opt & Photon, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
Monocular laparoscopic videos; Feature description; Feature matching; Vision transformer; Dual-correlate optimization; Sequential coupling;
D O I
10.1016/j.compbiomed.2023.107890
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Feature matching of monocular laparoscopic videos is crucial for visualization enhancement in computer-assisted surgery, and the keys to conducting high-quality matches are accurate homography estimation, relative pose estimation, as well as sufficient matches and fast calculation. However, limited by various monocular laparoscopic imaging characteristics such as highlight noises, motion blur, texture interference and illumination variation, most exiting feature matching methods face the challenges of producing high-quality matches efficiently and sufficiently. To overcome these limitations, this paper presents a novel sequential coupling feature descriptor to extract and express multilevel feature maps efficiently, and a dual-correlate optimized coarse-fine strategy to establish dense matches in coarse level and adjust pixel-wise matches in fine level. Firstly, a novel sequential coupling swin transformer layer is designed in feature descriptor to learn and extract multilevel feature representations richly without increasing complexity. Then, a dual-correlate optimized coarse-fine strategy is proposed to match coarse feature sequences under low resolution, and the correlated fine feature sequences is optimized to refine pixel-wise matches based on coarse matching priors. Finally, the sequential coupling feature descriptor and dual-correlate optimization are merged into the Sequential Coupling DualCorrelate Network (SeCo DC-Net) to produce high-quality matches. The evaluation is conducted on two public laparoscopic datasets: Scared and EndoSLAM, and the experimental results show the proposed network outperforms state-of-the-art methods in homography estimation, relative pose estimation, reprojection error, matching pairs number and inference runtime. The source code is publicly available at https://github.com/Iheck zza/FeatureMatching.
引用
收藏
页数:22
相关论文
共 54 条
  • [41] LoFTR: Detector-Free Local Feature Matching with Transformers
    Sun, Jiaming
    Shen, Zehong
    Wang, Yuang
    Bao, Hujun
    Zhou, Xiaowei
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8918 - 8927
  • [42] Sun WX, 2022, Arxiv, DOI arXiv:2206.10552
  • [43] Tyszkiewicz Michal, 2020, ADV NEURAL INF PROCE, V33, P14254
  • [44] Vaswani A, 2017, ADV NEUR IN, V30
  • [45] Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
    Wang, Wenhai
    Xie, Enze
    Li, Xiang
    Fan, Deng-Ping
    Song, Kaitao
    Liang, Ding
    Lu, Tong
    Luo, Ping
    Shao, Ling
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 548 - 558
  • [46] Non-local Neural Networks
    Wang, Xiaolong
    Girshick, Ross
    Gupta, Abhinav
    He, Kaiming
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7794 - 7803
  • [47] Robust Feature Matching for Remote Sensing Image Registration via Guided Hyperplane Fitting
    Xiao, Guobao
    Luo, Huan
    Zeng, Kun
    Wei, Leyi
    Ma, Jiayi
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [48] Deterministic Model Fitting by Local-Neighbor Preservation and Global-Residual Optimization
    Xiao, Guobao
    Ma, Jiayi
    Wang, Shiping
    Chen, Changwen
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) : 8988 - 9001
  • [49] Self-Supervised Monocular Depth Estimation With 3-D Displacement Module for Laparoscopic Images
    Xu, Chi
    Huang, Baoru
    Elson, Daniel S.
    [J]. IEEE TRANSACTIONS ON MEDICAL ROBOTICS AND BIONICS, 2022, 4 (02): : 331 - 334
  • [50] Learning feature descriptors for pre- and intra-operative point cloud matching for laparoscopic liver registration
    Yang, Zixin
    Simon, Richard
    Linte, Cristian A. A.
    [J]. INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2023, 18 (06) : 1025 - 1032