BDIS: Bayesian Dense Inverse Searching Method for Real-Time Stereo Surgical Image Matching

被引:10
作者
Song, Jingwei [1 ,2 ]
Zhu, Qiuchen [3 ]
Lin, Jianyu [4 ]
Ghaffari, Maani [1 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
[2] United Imaging Res Inst Intelligent Imaging, Beijing 100144, Peoples R China
[3] Univ Technol Sydney, Sch Elect & Data Engn, Broadway, NSW 2007, Australia
[4] Imperial Coll London, Hamlyn Ctr Robot Surg, London SW7 2AZ, England
基金
美国国家科学基金会;
关键词
Real-time systems; Estimation; Task analysis; Surgery; Training; Industries; Graphics processing units; Bayesian theory; posterior probability inference; stereo matching; INVASIVE SURGERY; RECONSTRUCTION;
D O I
10.1109/TRO.2022.3215018
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In stereoscope-based Minimally Invasive Surgeries (MIS), dense stereo matching plays an indispensable role in 3D shape recovery, AR, VR, and navigation tasks. Although numerous Deep Neural Network (DNN) approaches are proposed, the conventional prior-free approaches are still popular in the industry because of the lack of open-source annotated data set and the limitation of the task-specific pre-trained DNNs. Among the prior-free stereo matching algorithms, there is no successful real-time algorithm in none GPU environment for MIS. This paper proposes the first CPU-level real-time prior-free stereo matching algorithm for general MIS tasks. We achieve an average 17 Hz on 640*480 images with a single-core CPU (i5-9400) for surgical images. Meanwhile, it achieves slightly better accuracy than the popular ELAS. The patch-based fast disparity searching algorithm is adopted for the rectified stereo images. A coarse-to-fine Bayesian probability and a spatial Gaussian mixed model were proposed to evaluate the patch probability at different scales. An optional probability density function estimation algorithm was adopted to quantify the prediction variance. Extensive experiments demonstrated the proposed method's capability to handle ambiguities introduced by the textureless surfaces and the photometric inconsistency from the non-Lambertian reflectance and dark illumination. The estimated probability managed to balance the confidences of the patches for stereo images at different scales. It has similar or higher accuracy and fewer outliers than the baseline ELAS in MIS, while it is 4-5 times faster. The code and the synthetic data sets are available at https://github.com/JingweiSong/BDIS-v2.
引用
收藏
页码:1388 / 1406
页数:19
相关论文
共 65 条
[1]  
Allan M, 2021, ARXIV
[2]  
Andrew A. M., 2001, MULTIPLE VIEW GEOMET, V30, P1333, DOI [10.1108/k.2001.30.9_10.1333.2, DOI 10.1108/K.2001.30.9_10.1333.2]
[3]  
[Anonymous], 2008, P 25 INT C MACHINE L
[4]  
Atapour-Abarghouei Amir, 2018, P IEEE C COMPUTER VI, P2800, DOI DOI 10.1109/CVPR.2018.00296
[5]   Lucas-Kanade 20 years on: A unifying framework [J].
Baker, S ;
Matthews, I .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 56 (03) :221-255
[6]  
Bishop C. M., 2006, PATTERN RECOGN
[7]   HAPNet: hierarchically aggregated pyramid network for real-time stereo matching [J].
Brandao, Patrick ;
Psychogyios, Dimitris ;
Mazomenos, Evangelos ;
Stoyanov, Danail ;
Janatka, Mirek .
COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2021, 9 (03) :219-224
[8]   VisionBlender: a tool to efficiently generate computer vision datasets for robotic surgery [J].
Cartucho, Joao ;
Tukra, Samyakh ;
Li, Yunpeng ;
Elson, Daniel S. ;
Giannarou, Stamatia .
COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2021, 9 (04) :331-338
[9]   Pyramid Stereo Matching Network [J].
Chang, Jia-Ren ;
Chen, Yong-Sheng .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :5410-5418
[10]   S2R-DepthNet: Learning a Generalizable Depth-specific Structural Representation [J].
Chen, Xiaotian ;
Wang, Yuwang ;
Chen, Xuejin ;
Zeng, Wenjun .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :3033-3042