On the Monocular 3-D Pose Estimation for Arbitrary Shaped Needle in Dynamic Scenes: An Efficient Visual Learning and Geometry Modeling Approach

被引:1
作者
Li, Bin [1 ]
Lu, Bo [2 ]
Lin, Hongbin [1 ]
Wang, Yaxiang [1 ]
Zhong, Fangxun [1 ]
Dou, Qi [3 ,4 ]
Liu, Yun-Hui [1 ]
机构
[1] Chinese Univ Hong Kong, T Stone Robot Inst, Dept Mech & Automat Engn, Hong Kong, Peoples R China
[2] Soochow Univ, Robot & Microsyst Ctr, Sch Mech & Elect Engn, Suzhou 215131, Jiangsu, Peoples R China
[3] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Hong Kong, Peoples R China
[4] Chinese Univ Hong Kong, T Stone Robot Inst, Hong Kong, Peoples R China
来源
IEEE TRANSACTIONS ON MEDICAL ROBOTICS AND BIONICS | 2024年 / 6卷 / 02期
关键词
Needles; Pose estimation; Solid modeling; Feature extraction; Surgery; Shape; Visualization; Surgical Robotics; Pose Estimation; Geometry Modeling; Vision-based Manipulation;
D O I
10.1109/TMRB.2024.3377357
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Image-guided needle pose estimation is crucial for robotic autonomous suturing, but it poses significant challenges due to the needle's slender visual projection and dynamic surgical environments. Current state-of-the-art methods rely on additional prior information (e.g., in-hand grasp, accurate kinematics, etc.) to achieve sub-millimeter accuracy, hindering their applicability in varying surgical scenes. This paper presents a new generic framework for monocular needle pose estimation: Visual learning network for efficient geometric primitives extraction and novel geometry model for accurate pose recovery. To capture needle's primitives precisely, we introduce a morphology-based mask contour fusion mechanism in a multi-scale manner. We then establish a novel state representation for needle pose and develop a physical projection model to derive its relationship with the primitives. An anti-occlusion objective is formulated to jointly optimize the pose and bias of inference primitives, achieving sub-millimeter accuracy under occlusion scenarios. Our approach requires neither CAD model nor circular shape assumption and can extensively estimate poses of other small planar axisymmetric objects. Experiments on ex-/in-vivo scenarios validate the accuracy of estimated intermediate primitives and final poses of needles. We further deploy our framework on the dVRK platform for automatic and precise needle manipulations, demonstrating the feasibility for use in robotic surgery.
引用
收藏
页码:460 / 474
页数:15
相关论文
共 40 条
[1]   A Dataset and Benchmarks for Segmentation and Recognition of Gestures in Robotic Surgery [J].
Ahmidi, Narges ;
Tao, Lingling ;
Sefati, Shahin ;
Gao, Yixin ;
Lea, Colin ;
Haro, Benjamin Bejar ;
Zappella, Luca ;
Khudanpur, Sanjeev ;
Vidal, Rene ;
Hager, Gregory D. .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2017, 64 (09) :2025-2041
[2]   Detecting Surgical Tools by Modelling Local Appearance and Global Shape [J].
Bouget, David ;
Benenson, Rodrigo ;
Omran, Mohamed ;
Riffaud, Laurent ;
Schiele, Bernt ;
Jannin, Pierre .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2015, 34 (12) :2603-2617
[3]  
Bradbury J., 2018, JAX COMPOSABLE TRANS
[4]   Deep learning robotic guidance for autonomous vascular access [J].
Chen, Alvin I. ;
Balter, Max L. ;
Maguire, Timothy J. ;
Yarmush, Martin L. .
NATURE MACHINE INTELLIGENCE, 2020, 2 (02) :104-+
[5]   SGPA: Structure-Guided Prior Adaptation for Category-Level 6D Object Pose Estimation [J].
Chen, Kai ;
Dou, Qi .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :2753-2762
[6]   Real-Time Constrained 6D Object-Pose Tracking of An In-Hand Suture Needle for Minimally Invasive Robotic Surgery [J].
Chiu, Zih-Yun ;
Richter, Florian ;
Yip, Michael C. .
2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, :4761-4767
[7]  
Chiu Zih-Yun., 2021, arXiv
[8]   Xception: Deep Learning with Depthwise Separable Convolutions [J].
Chollet, Francois .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1800-1807
[9]  
D'Ettorre C, 2018, IEEE INT CONF ROBOT, P1370, DOI 10.1109/ICRA.2018.8461200
[10]   Articulated Multi-Instrument 2-D Pose Estimation Using Fully Convolutional Networks [J].
Du, Xiaofei ;
Kurmann, Thomas ;
Chang, Ping-Lin ;
Allan, Maximilian ;
Ourselin, Sebastien ;
Sznitman, Raphael ;
Kelly, John D. ;
Stoyanov, Danail .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2018, 37 (05) :1276-1287