BDIS: Bayesian Dense Inverse Searching Method for Real-Time Stereo Surgical Image Matching

被引:10
作者
Song, Jingwei [1 ,2 ]
Zhu, Qiuchen [3 ]
Lin, Jianyu [4 ]
Ghaffari, Maani [1 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
[2] United Imaging Res Inst Intelligent Imaging, Beijing 100144, Peoples R China
[3] Univ Technol Sydney, Sch Elect & Data Engn, Broadway, NSW 2007, Australia
[4] Imperial Coll London, Hamlyn Ctr Robot Surg, London SW7 2AZ, England
基金
美国国家科学基金会;
关键词
Real-time systems; Estimation; Task analysis; Surgery; Training; Industries; Graphics processing units; Bayesian theory; posterior probability inference; stereo matching; INVASIVE SURGERY; RECONSTRUCTION;
D O I
10.1109/TRO.2022.3215018
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In stereoscope-based Minimally Invasive Surgeries (MIS), dense stereo matching plays an indispensable role in 3D shape recovery, AR, VR, and navigation tasks. Although numerous Deep Neural Network (DNN) approaches are proposed, the conventional prior-free approaches are still popular in the industry because of the lack of open-source annotated data set and the limitation of the task-specific pre-trained DNNs. Among the prior-free stereo matching algorithms, there is no successful real-time algorithm in none GPU environment for MIS. This paper proposes the first CPU-level real-time prior-free stereo matching algorithm for general MIS tasks. We achieve an average 17 Hz on 640*480 images with a single-core CPU (i5-9400) for surgical images. Meanwhile, it achieves slightly better accuracy than the popular ELAS. The patch-based fast disparity searching algorithm is adopted for the rectified stereo images. A coarse-to-fine Bayesian probability and a spatial Gaussian mixed model were proposed to evaluate the patch probability at different scales. An optional probability density function estimation algorithm was adopted to quantify the prediction variance. Extensive experiments demonstrated the proposed method's capability to handle ambiguities introduced by the textureless surfaces and the photometric inconsistency from the non-Lambertian reflectance and dark illumination. The estimated probability managed to balance the confidences of the patches for stereo images at different scales. It has similar or higher accuracy and fewer outliers than the baseline ELAS in MIS, while it is 4-5 times faster. The code and the synthetic data sets are available at https://github.com/JingweiSong/BDIS-v2.
引用
收藏
页码:1388 / 1406
页数:19
相关论文
共 65 条
[21]   Accurate and efficient stereo processing by semi-global matching and mutual information [J].
Hirschmüller, H .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2005, :807-814
[22]   Automatic Polyp Recognition in Colonoscopy Images Using Deep Learning and Two-Stage Pyramidal Feature Prediction [J].
Jia, Xiao ;
Mai, Xiaochun ;
Cui, Yi ;
Yuan, Yixuan ;
Xing, Xiaohan ;
Seo, Hyunseok ;
Xing, Lei ;
Meng, Max Q. -H. .
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2020, 17 (03) :1570-1584
[23]   Development of the PEO Based Solid Polymer Electrolytes for All-Solid State Lithium Ion Batteries [J].
Jiang, Yu ;
Yan, Xuemin ;
Ma, Zhaofei ;
Mei, Ping ;
Xiao, Wei ;
You, Qinliang ;
Zhang, Yan .
POLYMERS, 2018, 10 (11)
[24]   End-to-End Learning of Geometry and Context for Deep Stereo Regression [J].
Kendall, Alex ;
Martirosyan, Hayk ;
Dasgupta, Saumitro ;
Henry, Peter ;
Kennedy, Ryan ;
Bachrach, Abraham ;
Bry, Adam .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :66-75
[25]  
Kingma DP, 2014, ADV NEUR IN, V27
[26]  
Klaus A, 2006, INT C PATT RECOG, P15
[27]   Fast Optical Flow Using Dense Inverse Search [J].
Kroeger, Till ;
Timofte, Radu ;
Dai, Dengxin ;
Van Gool, Luc .
COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 :471-488
[28]  
Lafferty J.D., 2001, P 18 INT C MACH LEAR, DOI [10.5555/645530.655813, DOI 10.5555/645530.655813]
[29]   Optimizing ZNCC calculation in binocular stereo matching [J].
Lin, Chuan ;
Li, Ya ;
Xu, Guili ;
Cao, Yijun .
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2017, 52 :64-73
[30]  
Lin Zhang, 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA), P3587, DOI 10.1109/ICRA.2017.7989412