TSC-DL: Unsupervised Trajectory Segmentation of Multi-Modal Surgical Demonstrations with Deep Learning

被引：0

作者：

Murali, Adithyavairavan ^{[1
]}

Garg, Animesh ^{[1
]}

Krishnan, Sanjay ^{[1
]}

Pokorny, Florian T. ^{[1
]}

Abbeel, Pieter ^{[1
]}

Darrell, Trevor ^{[1
]}

Goldberg, Ken ^{[1
]}

机构：

[1] Univ Calif, EECS & IEOR, Berkeley, CA USA

来源：

2016 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA) | 2016年

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The growth of robot-assisted minimally invasive surgery has led to sizable datasets of fixed-camera video and kinematic recordings of surgical subtasks. Segmentation of these trajectories into locally-similar contiguous sections can facilitate learning from demonstrations, skill assessment, and salvaging good segments from otherwise inconsistent demonstrations. Manual, or supervised, segmentation can be prone to error and impractical for large datasets. We present Transition State Clustering with Deep Learning (TSC-DL), a new unsupervised algorithm that leverages video and kinematic data for task-level segmentation, and finds regions of the visual feature space that correlate with transition events using features constructed from layers of pre-trained image classification Deep Convolutional Neural Networks (CNNs). We report results on three datasets comparing Deep Learning architectures (AlexNet and VGG), choice of convolutional layer, dimensionality reduction techniques, visual encoding, and the use of Scale Invariant Feature Transforms (SIFT). We find that the deep architectures extract features that result in up-to a 30.4% improvement in Silhouette Score (a measure of cluster tightness) over the traditional "shallow" features from SIFT. We also present cases where TSC-DL discovers human annotator omissions. Supplementary material, data and code is available at: http://berkeleyautomation.github.io/tsc-dl/

引用

页码：4150 / 4157

页数：8

共 50 条

[1] Unsupervised Trajectory Segmentation and Promoting of Multi-Modal Surgical Demonstrations
Shao, Zhenzhou
Zhao, Hongfa
Xie, Jiexin
Qu, Ying
Guan, Yong
Tan, Jindong
2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 777 - 782
[2] Unsupervised Multi-modal Learning
Iqbal, Mohammed Shameer
ADVANCES IN ARTIFICIAL INTELLIGENCE (AI 2015), 2015, 9091 : 343 - 346
[3] A Fast Approach for Multi-Modality Surgical Trajectory Segmentation with Unsupervised Deep Learning
Xie J.
Zhao H.
Shao Z.
Shi Z.
Guan Y.
Jiqiren/Robot, 2019, 41 (03): : 317 - 326and333
[4] Deep Collaborative Multi-Modal Learning for Unsupervised Kinship Estimation
Dong, Guan-Nan
Pun, Chi-Man
Zhang, Zheng
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 : 4197 - 4210
[5] Multi-Modal Object Tracking and Image Fusion With Unsupervised Deep Learning
LaHaye, Nicholas
Ott, Jordan
Garay, Michael J.
El-Askary, Hesham Mohamed
Linstead, Erik
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2019, 12 (08) : 3056 - 3066
[6] Deep Robust Unsupervised Multi-Modal Network
Yang, Yang
Wu, Yi-Feng
Zhan, De-Chuan
Liu, Zhi-Bin
Jiang, Yuan
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 5652 - 5659
[7] A framework for unsupervised segmentation of multi-modal medical images
El-Baz, Ayman
Farag, Aly
Ali, Asem
Gimel'farb, Georgy
Casanova, Manuel
COMPUTER VISION APPROACHES TO MEDICAL IMAGE ANALYSIS, 2006, 4241 : 120 - 131
[8] Burn-In Demonstrations for Multi-Modal Imitation Learning
Kuefler, Alex
Kochenderfer, Mykel J.
PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), 2018, : 1071 - 1078
[9] A Multi-Modal Learning System for On-Line Surgical Action Segmentation
De Rossi, Giacomo
Roin, Serena
Setti, Francesco
Muradore, Riccardo
2020 INTERNATIONAL SYMPOSIUM ON MEDICAL ROBOTICS (ISMR), 2020, : 132 - 138
[10] Multi-modal body part segmentation of infants using deep learning
Voss, Florian
Brechmann, Noah
Lyra, Simon
Rixen, Joeran
Leonhardt, Steffen
Antink, Christoph Hoog
BIOMEDICAL ENGINEERING ONLINE, 2023, 22 (01)

← 1 2 3 4 5 →