DefSLAM: Tracking and Mapping of Deforming Scenes From Monocular Sequences

被引:57
作者
Lamarca, Jose [1 ]
Parashar, Shaifali [2 ]
Bartoli, Adrien [2 ]
Montiel, J. M. M. [1 ]
机构
[1] Univ Zaragoza, Inst Invest Ingn Aragon I3A, Zaragoza 50009, Spain
[2] UCA, CHU, CNRS, Inst Pascal,UMR 6602, F-63000 Clermont Ferrand, France
基金
欧盟地平线“2020”;
关键词
Strain; Simultaneous localization and mapping; Cameras; Shape; Biomedical imaging; Instruction sets; Visualization; Deformable simultaneous localization and mapping; minimal invasive surgery; real-time systems; robustness; simultaneous localization and mapping; SLAM; strain; surgery; surgical vision; three-dimensional displays;
D O I
10.1109/TRO.2020.3020739
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Monocular simultaneous localization and mapping (SLAM) algorithms perform robustly when observing rigid scenes; however, they fail when the observed scene deforms, for example, in medical endoscopy applications. In this article, we present DefSLAM, the first monocular SLAM capable of operating in deforming scenes in real time. Our approach intertwines Shape-from-Template (SfT) and Non-Rigid Structure-from-Motion (NRSfM) techniques to deal with the exploratory sequences typical of SLAM. A deformation tracking thread recovers the pose of the camera and the deformation of the observed map, at frame rate, by means of SfT processing a template that models the scene shape-at-rest. A deformation mapping thread runs in parallel with the tracking to update the template, at keyframe rate, by means of an isometric NRSfM processing a batch of full perspective keyframes. In our experiments, DefSLAM processes close-up sequences of deforming scenes, both in a laboratory-controlled experiment and in medical endoscopy sequences, producing accurate 3-D models of the scene with respect to the moving camera.
引用
收藏
页码:291 / 303
页数:13
相关论文
共 43 条
[1]  
Agarwal Sameer., 2010, Ceres Solver
[2]  
Agudo A, 2015, PROC CVPR IEEE, P2179, DOI 10.1109/CVPR.2015.7298830
[3]   Good Vibrations: A Modal Analysis Approach for Sequential Non-Rigid Structure from Motion [J].
Agudo, Antonio ;
Agapito, Lourdes ;
Calvo, Begona ;
Montiel, J. M. M. .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :1558-1565
[4]   Trajectory Space: A Dual Representation for Nonrigid Structure from Motion [J].
Akhter, Ijaz ;
Sheikh, Yaser ;
Khan, Sohaib ;
Kanade, Takeo .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (07) :1442-1456
[5]   Shape-from-Template [J].
Bartoli, Adrien ;
Gerard, Yan ;
Chadebecq, Francois ;
Collins, Toby ;
Pizarro, Daniel .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (10) :2099-2118
[6]  
Bradski G, 2000, DR DOBBS J, V25, P120
[7]  
Bregler C, 2000, PROC CVPR IEEE, P690, DOI 10.1109/CVPR.2000.854941
[8]  
Chhatkuli A., 2014, P BRITISHMACH VIS C
[9]   A Stable Analytical Framework for Isometric Shape-from-Template by Surface Integration [J].
Chhatkuli, Ajad ;
Pizarro, Daniel ;
Bartoli, Adrien ;
Collins, Toby .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (05) :833-850
[10]   Inextensible Non-Rigid Shape-from-Motion by Second-Order Cone Programming [J].
Chhatkuli, Ajad ;
Pizarro, Daniel ;
Collins, Toby ;
Bartoli, Adrien .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1719-1727