BDIS: Bayesian Dense Inverse Searching Method for Real-Time Stereo Surgical Image Matching

被引:10
作者
Song, Jingwei [1 ,2 ]
Zhu, Qiuchen [3 ]
Lin, Jianyu [4 ]
Ghaffari, Maani [1 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
[2] United Imaging Res Inst Intelligent Imaging, Beijing 100144, Peoples R China
[3] Univ Technol Sydney, Sch Elect & Data Engn, Broadway, NSW 2007, Australia
[4] Imperial Coll London, Hamlyn Ctr Robot Surg, London SW7 2AZ, England
基金
美国国家科学基金会;
关键词
Real-time systems; Estimation; Task analysis; Surgery; Training; Industries; Graphics processing units; Bayesian theory; posterior probability inference; stereo matching; INVASIVE SURGERY; RECONSTRUCTION;
D O I
10.1109/TRO.2022.3215018
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In stereoscope-based Minimally Invasive Surgeries (MIS), dense stereo matching plays an indispensable role in 3D shape recovery, AR, VR, and navigation tasks. Although numerous Deep Neural Network (DNN) approaches are proposed, the conventional prior-free approaches are still popular in the industry because of the lack of open-source annotated data set and the limitation of the task-specific pre-trained DNNs. Among the prior-free stereo matching algorithms, there is no successful real-time algorithm in none GPU environment for MIS. This paper proposes the first CPU-level real-time prior-free stereo matching algorithm for general MIS tasks. We achieve an average 17 Hz on 640*480 images with a single-core CPU (i5-9400) for surgical images. Meanwhile, it achieves slightly better accuracy than the popular ELAS. The patch-based fast disparity searching algorithm is adopted for the rectified stereo images. A coarse-to-fine Bayesian probability and a spatial Gaussian mixed model were proposed to evaluate the patch probability at different scales. An optional probability density function estimation algorithm was adopted to quantify the prediction variance. Extensive experiments demonstrated the proposed method's capability to handle ambiguities introduced by the textureless surfaces and the photometric inconsistency from the non-Lambertian reflectance and dark illumination. The estimated probability managed to balance the confidences of the patches for stereo images at different scales. It has similar or higher accuracy and fewer outliers than the baseline ELAS in MIS, while it is 4-5 times faster. The code and the synthetic data sets are available at https://github.com/JingweiSong/BDIS-v2.
引用
收藏
页码:1388 / 1406
页数:19
相关论文
共 65 条
[31]   E-DSSR: Efficient Dynamic Surgical Scene Reconstruction with Transformer-Based Stereoscopic Depth Perception [J].
Long, Yonghao ;
Li, Zhaoshuo ;
Yee, Chi Hang ;
Ng, Chi Fai ;
Taylor, Russell H. ;
Unberath, Mathias ;
Dou, Qi .
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT IV, 2021, 12904 :415-425
[32]  
Lucas B. D., 1981, 7th international joint conference on Artificial intelligence, V2, P674
[33]   Polyp segmentation and classification using predicted depth from monocular endoscopy [J].
Mahmood, Faisal ;
Yang, Ziyun ;
Chen, Richard ;
Borders, Daniel ;
Xu, Wenhao ;
Durr, Nicholas J. .
MEDICAL IMAGING 2019: COMPUTER-AIDED DIAGNOSIS, 2019, 10950
[34]   Object Scene Flow [J].
Menze, Moritz ;
Heipke, Christian ;
Geiger, Andreas .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 140 :60-76
[35]  
Min DB, 2011, IEEE I CONF COMP VIS, P1567, DOI 10.1109/ICCV.2011.6126416
[36]   Dynamic View Expansion for Minimally Invasive Surgery using Simultaneous Localization And Mapping [J].
Mountney, Peter ;
Yang, Guang-Zhong .
2009 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-20, 2009, :1184-1187
[37]  
Paszke A, 2019, ADV NEUR IN, V32
[38]  
Pratt P, 2014, LECT NOTES COMPUT SC, V8674, P667, DOI 10.1007/978-3-319-10470-6_83
[39]  
Quattoni A., 2004, Advances in Neural Information Processing Systems (NIPS), P1097
[40]   Hidden conditional random fields [J].
Quattoni, Ariadna ;
Wang, Sybor ;
Morency, Louis-Philippe ;
Collins, Michael ;
Darrell, Trevor .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2007, 29 (10) :1848-1853