Real-time recognition of surgical tasks in eye surgery videos

被引:28
作者
Quellec, Gwenole [1 ]
Charriere, Katia [1 ,2 ]
Lamard, Mathieu [1 ,3 ]
Droueche, Zakarya [1 ,2 ]
Roux, Christian [1 ,2 ]
Cochener, Beatrice [1 ,3 ,4 ]
Cazuguel, Guy [1 ,2 ]
机构
[1] INSERM, UMR 1101, F-29200 Brest, France
[2] UEB, TELECOM Bretagne, Inst Mines Telecom, Dpt ITI, F-29200 Brest, France
[3] Univ Bretagne Occidentale, F-29200 Brest, France
[4] CHRU Brest, Serv Ophtalmol, F-29200 Brest, France
关键词
CBVR; Real-time; Surgical task recognition; Eye surgery; IMAGE RETRIEVAL; WORKFLOW; OUTCOMES; SYSTEMS;
D O I
10.1016/j.media.2014.02.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, many surgeries, including eye surgeries, are video-monitored. We present in this paper an automatic video analysis system able to recognize surgical tasks in real-time. The proposed system relies on the Content-Based Video Retrieval (CBVR) paradigm. It characterizes short subsequences in the video stream and searches for video subsequences with similar structures in a video archive. Fixed-length feature vectors are built for each subsequence: the feature vectors are unchanged by variations in duration and temporal structure among the target surgical tasks. Therefore, it is possible to perform fast nearest neighbor searches in the video archive. The retrieved video subsequences are used to recognize the current surgical task by analogy reasoning. The system can be trained to recognize any surgical task using weak annotations only. It was applied to a dataset of 23 epiretinal membrane surgeries and a dataset of 100 cataract surgeries. Three surgical tasks were annotated in the first dataset. Nine surgical tasks were annotated in the second dataset To assess its generality, the system was also applied to a dataset of 1,707 movie clips in which 12 human actions were annotated. High task recognition scores were measured in all three datasets. Real-time task recognition will be used in future works to communicate with surgeons (trainees in particular) or with surgical devices. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:579 / 590
页数:12
相关论文
共 45 条
  • [1] André B, 2010, LECT NOTES COMPUT SC, V6362, P480
  • [2] [Anonymous], 1974, Solving least squares problems
  • [3] ARYA S, 1993, PROCEEDINGS OF THE FOURTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P271
  • [4] Blum T, 2010, LECT NOTES COMPUT SC, V6363, P400
  • [5] Design of multimodal dissimilarity spaces for retrieval of video documents
    Bruno, Eric
    Moenne-Loccoz, Nicolas
    Marchand-Maillet, Stephane
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (09) : 1520 - 1533
  • [6] Cano AM, 2008, LECT NOTES COMPUT SC, V5104, P191, DOI 10.1007/978-3-540-70521-5_21
  • [7] Computer-aided detection of diagnostic and therapeutic operations in colonoscopy videos
    Cao, Yu
    Liu, Danyu
    Tavanapong, Wallapak
    Wong, Johnny
    Oh, JungHwan
    de Groen, Piet C.
    [J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2007, 54 (07) : 1268 - 1279
  • [8] Clinical outcomes and costs of cataract surgery performed by planned ECCE and phacoemulsification
    Castells, X
    Comas, M
    Castilla, M
    Cots, F
    Alarcón, S
    [J]. INTERNATIONAL OPHTHALMOLOGY, 1998, 22 (06) : 363 - 367
  • [9] Visual outcomes after pars plana vitrectomy for epiretinal membranes associated with pars planitis
    Dev, S
    Mieler, WF
    Pulido, JS
    Mittra, RA
    [J]. OPHTHALMOLOGY, 1999, 106 (06) : 1086 - 1090
  • [10] An Image-Based Approach to Video Copy Detection With Spatio-Temporal Post-Filtering
    Douze, Matthijs
    Jegou, Herve
    Schmid, Cordelia
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2010, 12 (04) : 257 - 266