Keyframe Extraction From Laparoscopic Videos via Diverse and Weighted Dictionary Selection

被引:13
作者
Ma, Mingyang [1 ]
Mei, Shaohui [1 ]
Wan, Shuai [1 ]
Wang, Zhiyong [2 ]
Ge, Zongyuan [3 ,4 ,5 ]
Lam, Vincent [6 ,7 ]
Feng, Dagan [2 ]
机构
[1] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710129, Shaanxi, Peoples R China
[2] Univ Sydney, Sch Comp Sci, Sydney, NSW 2006, Australia
[3] Monash Univ, eRes, Clayton, Vic 3800, Australia
[4] Monash Univ, Fac Engn, Clayton, Vic 3800, Australia
[5] Airdoc Res, Clayton, Vic 3800, Australia
[6] Westmead Hosp, Dept Gen Surg, Westmead, NSW 2145, Australia
[7] Macquarie Univ, Dept Clin Med, N Ryde, NSW 2113, Australia
基金
中国国家自然科学基金;
关键词
Videos; Laparoscopes; Feature extraction; Dictionaries; Surgery; Visualization; Biomedical imaging; Dictionary selection; keyframe extraction; laparoscopic videos; video summarization; ATTENTION DRIVEN FRAMEWORK;
D O I
10.1109/JBHI.2020.3019198
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Laparoscopic videos have been increasingly acquired for various purposes including surgical training and quality assurance, due to the wide adoption of laparoscopy in minimally invasive surgeries. However, it is very time consuming to view a large amount of laparoscopic videos, which prevents the values of laparoscopic video archives from being well exploited. In this paper, a dictionary selection based video summarization method is proposed to effectively extract keyframes for fast access of laparoscopic videos. Firstly, unlike the low-level feature used in most existing summarization methods, deep features are extracted from a convolutional neural network to effectively represent video frames. Secondly, based on such a deep representation, laparoscopic video summarization is formulated as a diverse and weighted dictionary selection model, in which image quality is taken into account to select high quality keyframes, and a diversity regularization term is added to reduce redundancy among the selected keyframes. Finally, an iterative algorithm with a rapid convergence rate is designed for model optimization, and the convergence of the proposed method is also analyzed. Experimental results on a recently released laparoscopic dataset demonstrate the clear superiority of the proposed methods. The proposed method can facilitate the access of key information in surgeries, training of junior clinicians, explanations to patients, and archive of case files.
引用
收藏
页码:1686 / 1698
页数:13
相关论文
共 46 条
[1]  
Angela S, 2018, LAPAROSCOPY UK 1 5 M
[2]  
[Anonymous], 2013, 2013 VISUAL COMMUNIC, DOI DOI 10.1109/VCIP.2013.6706410
[3]  
[Anonymous], 2011, 2011 E HLTH BIOENGIN
[4]  
Antony J, 2016, INT C PATT RECOG, P1195, DOI 10.1109/ICPR.2016.7899799
[5]   Efficient Bronchoscopic Video Summarization [J].
Byrnes, Patrick D. ;
Higgins, William Evan .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2019, 66 (03) :848-863
[6]  
Chatzichristofis Savvas A., 2008, 2008 Ninth International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS), P191, DOI 10.1109/WIAMIS.2008.24
[7]  
Chatzichristofis SA, 2008, LECT NOTES COMPUT SC, V5008, P312
[8]   Adaptive Greedy Dictionary Selection for Web Media Summarization [J].
Cong, Yang ;
Liu, Ji ;
Sun, Gan ;
You, Quanzeng ;
Li, Yuncheng ;
Luo, Jiebo .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (01) :185-195
[9]   Towards Scalable Summarization of Consumer Videos Via Sparse Dictionary Selection [J].
Cong, Yang ;
Yuan, Junsong ;
Luo, Jiebo .
IEEE TRANSACTIONS ON MULTIMEDIA, 2012, 14 (01) :66-75
[10]   Adaptive Dictionary Reconstruction for Compressed Sensing of ECG Signals [J].
Craven, Darren ;
McGinley, Brian ;
Kilmartin, Liam ;
Glavin, Martin ;
Jones, Edward .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2017, 21 (03) :645-654