Multi-class Enhanced Image Mining of Heterogeneous Textual Images Using Multiple Image Features

被引:5
作者
Chitrakala, S. [1 ]
Shamini, P. [1 ]
Manjula, D. [2 ]
机构
[1] Anna Univ, Easwari Engn Coll, Madras, Tamil Nadu, India
[2] Anna Univ, Dept Comp Sci & Engn, Madras, Tamil Nadu, India
来源
2009 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE, VOLS 1-3 | 2009年
关键词
caption text; scene text; decision tree; image classification; GLCM features; image features;
D O I
10.1109/IADCC.2009.4809061
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Image mining deals with the extraction of implicit knowledge, image data relationship, or other patterns not explicitly stored in the images. This paper proposes an enhanced image classifier to extract patterns from images containing text using a combination of features. Image containing text can be divided into the following types : scene text image, caption text image and document image. A total of eight features including intensity histogram features and GLCM texture features are used to classify the images. In the first level of classification, the histogram features are extracted from grayscale images to separate document image from the others. In the second stage, the GLCM features are extracted from binary images to classify scene text and caption text images. In both stages, the decision tree classifier (DTC) is used for the classification. Experimental results have been obtained for a dataset of about 60 images of different types. This technique of classification has not been attempted before and its applications include preprocessing for indexing of images, for simplifying and speeding up Content Based Image Retrieval (CBIR) techniques and in areas of Machine Vision.
引用
收藏
页码:496 / +
页数:2
相关论文
共 9 条
[1]  
Arzhaeva Yulia, 2006, IMAGE CLASSIFICATION
[2]  
Breiman L., 1984, Classification and Regression Trees, V432, P151
[3]  
DUONG TT, 2008, UNSUPERVISED LEARNIN
[4]  
Hanif Shehzad Muhammad, C WORKSH ASS TECHN P
[5]  
Holmes G., 1994, Proceedings of the 1994 Second Australian and New Zealand Conference on Intelligent Information Systems (Cat. No.94TH8019), P357, DOI 10.1109/ANZIIS.1994.396988
[6]  
Le Saux Bertrand, 2003, IMAGE CLASSIFIERS SC
[7]  
LIN WH, 2002, INT WORKSH KNOWL DIS
[8]  
Rafkind Barry., 2006, Proceedings of the Workshop on Linking Natural Language Processing and Biology: Towards Deeper Biological Literature Analysis. Association for Computational Linguistics, P73
[9]  
Safavian S.R., 1991, IEEE T SYSTEMS MAN C, V21