Content-Based Image Indexing by Data Clustering and Inverse Document Frequency

被引:0
作者
Grycuk, Rafal [1 ]
Gabryel, Marcin [1 ]
Korytkowski, Marcin [1 ]
Scherer, Rafal [1 ]
机构
[1] Czestochowa Tech Univ, Inst Computat Intelligence, PL-42200 Czestochowa, Poland
来源
BEYOND DATABASES, ARCHITECTURES AND STRUCTURES, BDAS 2014 | 2014年 / 424卷
关键词
CBIR; content based image retrieval; image database; key-points; clustering; inverse document; MEAN SHIFT;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we present an algorithm for creating and searching large image databases. Effective browsing and searching such collections of images based on their content is one of the most important challenges of computer science. In the presented algorithm, the process of inserting data to the database consists of several stages. In the first step interest points are generated from images by e. g. SIFT, SURF or PCA SIFT algorithms. The resulting huge number of key points is then reduced by data clustering, in our case by a novel, parameterless version of the mean shift algorithm. The reduction is achieved by subsequent operation on generated cluster centers. This algorithm has been adapted specifically for the presented method. Cluster centers are treated as terms and images as documents in the term frequency-inverse document frequency (TF-IDF) algorithm. TF-IDF algorithm allows to create an indexed image database and to fast retrieve desired images. The proposed approach is validated by numerical experiments on images with different content.
引用
收藏
页码:374 / 383
页数:10
相关论文
共 22 条