Keyframe extraction from laparoscopic videos based on visual saliency detection

被引：26

作者：

Loukas, Constantinos ^{[1
]}

Varytimidis, Christos ^{[2
]}

Rapantzikos, Konstantinos ^{[2
]}

Kanakis, Meletios A. ^{[3
]}

机构：

[1] Univ Athens, Med Sch, Lab Med Phys, Mikras Asias 75 Str, Athens 11527, Greece

[2] Natl & Tech Univ Athens, Sch Elect & Comp Engn, Athens, Greece

[3] Great Ormond St Hosp Sick Children, Cardiothorac Surg Unit, London, England

来源：

COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE | 2018年 / 165卷

关键词：

Video analysis; Keyframe extraction; Hidden Markov multivariate autoregressive models; Visual saliency; ATTENTION DRIVEN FRAMEWORK; ENDOSCOPIC SURGERY VIDEOS; KEY-FRAMES; CLASSIFICATION; MODELS;

D O I：

10.1016/j.cmpb.2018.07.004

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Background and objective: Laparoscopic surgery offers the potential for video recording of the operation, which is important for technique evaluation, cognitive training, patient briefing and documentation. An effective way for video content representation is to extract a limited number of keyframes with semantic information. In this paper we present a novel method for keyframe extraction from individual shots of the operational video. Methods: The laparoscopic video was first segmented into video shots using an objectness model, which was trained to capture significant changes in the endoscope field of view. Each frame of a shot was then decomposed into three saliency maps in order to model the preference of human vision to regions with higher differentiation with respect to color, motion and texture. The accumulated responses from each map provided a 3D time series of saliency variation across the shot. The time series was modeled as a multivariate autoregressive process with hidden Markov states (HMMAR model). This approach allowed the temporal segmentation of the shot into a predefined number of states. A representative keyframe was extracted from each state based on the highest state-conditional probability of the corresponding saliency vector. Results: Our method was tested on 168 video shots extracted from various laparoscopic cholecystectomy operations from the publicly available Cholec80 dataset. Four state-of-the-art methodologies were used for comparison. The evaluation was based on two assessment metrics: Color Consistency Score (CCS), which measures the color distance between the ground truth (GT) and the closest keyframe, and Temporal Consistency Score (TCS), which considers the temporal proximity between GT and extracted keyframes. About 81% of the extracted keyframes matched the color content of the GT keyframes, compared to 77% yielded by the second-best method. The TCS of the proposed and the second-best method was close to 1.9 and 1.4 respectively. Conclusions: Our results demonstrated that the proposed method yields superior performance in terms of content and temporal consistency to the ground truth. The extracted keyframes provided highly semantic information that may be used for various applications related to surgical video content representation, such as workflow analysis, video summarization and retrieval. (C) 2018 Elsevier B.V. All rights reserved.

引用

页码：13 / 23

页数：11

共 50 条

[21] Visual saliency detection based on visual center shift
Hu, Jinge
Xiong, Jiang
Feng, Yuming
Onasanya, B. O.
2021 13TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2021, : 227 - 232
[22] 3D Keyframe Motion Extraction from Zapin Traditional Dance Videos
Albakri, Ikmal Faiq
Wafiy, Nik
Suaib, Norhaida Mohd
Rahim, Mohd Shafry Mohd
Yu, Hongchuan
COMPUTATIONAL SCIENCE AND TECHNOLOGY (ICCST 2019), 2020, 603 : 65 - 74
[23] Visual Saliency Detection Using Group Lasso Regularization in Videos of Natural Scenes
Souly, Nasim
Shah, Mubarak
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2016, 117 (01) : 93 - 110
[24] Visual Saliency Detection Using Group Lasso Regularization in Videos of Natural Scenes
Nasim Souly
Mubarak Shah
International Journal of Computer Vision, 2016, 117 : 93 - 110
[25] A Novel Image Segmentation Algorithm based on Visual Saliency Detection and Integrated Feature Extraction
Liu, Weiting
Qing, Xue
Zhou, Jian
PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON COMMUNICATION AND ELECTRONICS SYSTEMS (ICCES), 2016, : 966 - 970
[26] Saliency Prediction in Uncategorized Videos Based on Audio-Visual Correlation
Qamar, Maryam
Qamar, Suleman
Muneeb, Muhammad
Bae, Sung-Ho
Rahman, Anis
IEEE ACCESS, 2023, 11 : 15460 - 15470
[27] RELATIONAL ENTROPY-BASED SALIENCY DETECTION IN IMAGES AND VIDEOS
Duncan, Kester
Sarkar, Sudeep
2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 1093 - 1096
[28] CNN-based temporal detection of motion saliency in videos
Maczyta, Leo
Bouthemy, Patrick
Le Meur, Olivier
PATTERN RECOGNITION LETTERS, 2019, 128 : 298 - 305
[29] INFLUENCE OF COLOR ON VISUAL SALIENCY IN SHORT VIDEOS
Ciortan, Irina M.
Dinet, Eric
Tremeau, Alain
2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 1160 - 1164
[30] MULTI-KEYFRAME ABSTRACTION FROM VIDEOS
Li, Ping
Guo, Yanwen
Sun, Hanqiu
2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,

← 1 2 3 4 5 →