Endoscopic Image Classification and Retrieval using Clustered Convolutional Features

被引:32
作者
Ahmad, Jamil [1 ]
Muhammad, Khan [1 ]
Lee, Mi Young [1 ]
Baik, Sung Wook [1 ]
机构
[1] Sejong Univ, Digital Contents Res Inst, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
Image retrieval; Features extraction; Convolution; Classification; Spatial pooling; Endoscopy; COLOR; TEXTURE;
D O I
10.1007/s10916-017-0836-y
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
With the growing use of minimally invasive surgical procedures, endoscopic video archives are growing at a rapid pace. Efficient access to relevant content in such huge multimedia archives require compact and discriminative visual features for indexing and matching. In this paper, we present an effective method to represent images using salient convolutional features. Convolutional kernels from the first layer of a pre-trained convolutional neural network (CNN) are analyzed and clustered into multiple distinct groups, based on their sensitivity to colors and textures. Dominant features detected by each cluster are collected into a single, layout-preserving feature map using a spatial maximal activator pooling (SMAP) approach. A moving window based structured pooling method then captures spatial layout features and global shape information from the aggregated feature map to populate feature histograms. Finally, individual histograms for each cluster are combined into a single comprehensive feature histogram. Clustering convolutional feature space allow extraction of color and texture features of varying strengths. Further, the SMAP approach enable us to select dominant discriminative features. The proposed features are compact and capable of conveniently outperforming several existing features extraction approaches in retrieval and classification tasks on endoscopy images dataset.
引用
收藏
页数:12
相关论文
共 36 条
[1]   Embedded deep vision in smart cameras for multi-view objects representation and retrieval [J].
Ahmad, Jamil ;
Mehmood, Irfan ;
Rho, Seungmin ;
Chilamkurti, Naveen ;
Baik, Sung Wook .
COMPUTERS & ELECTRICAL ENGINEERING, 2017, 61 :297-311
[2]   SiNC: Saliency-injected neural codes for representation and efficient retrieval of medical radiographs [J].
Ahmad, Jamil ;
Sajjad, Muhammad ;
Mehmood, Irfan ;
Baik, Sung Wook .
PLOS ONE, 2017, 12 (08)
[3]   Efficient object-based surveillance image search using spatial pooling of convolutional features [J].
Ahmad, Jamil ;
Mehmood, Irfan ;
Baik, Sung Wook .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2017, 45 :62-76
[4]   Multi-scale local structure patterns histogram for describing visual contents in social image retrieval systems [J].
Ahmad, Jamil ;
Sajjad, Muhammad ;
Rho, Seungmin ;
Baik, Sung Wook .
MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (20) :12669-12692
[5]  
[Anonymous], J REAL TIME IMAGE PR
[6]  
[Anonymous], 2011, P EUR S ART NEUR NET
[7]  
[Anonymous], ADV NEURAL INF PROCE
[8]  
[Anonymous], 2008, COMPUT VIS IMAGE UND, DOI DOI 10.1016/j.cviu.2007.09.014
[9]   Neural Codes for Image Retrieval [J].
Babenko, Artem ;
Slesarev, Anton ;
Chigorin, Alexandr ;
Lempitsky, Victor .
COMPUTER VISION - ECCV 2014, PT I, 2014, 8689 :584-599
[10]  
Haas S., 2012, Medical Content-Based Retrieval for Clinical Decision Support, P58