A New Approach to Extract Text from Images based on DWT and K-means Clustering

被引:3
作者
Ghai, Deepika [1 ]
Gera, Divya [1 ]
Jain, Neelu [1 ]
机构
[1] PEC Univ Technol, ECE Dept, Sect 12, Chandigarh 160012, UT, India
关键词
Text extraction; Texture features; DWT; K-means clustering; sliding window; voting decision; VIDEO; LOCALIZATION;
D O I
10.1080/18756891.2016.1237189
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text present in image provides important information for automatic annotation, indexing and retrieval. Therefore, its extraction is a well known research area in computer vision. However, variations of text due to differences in orientation, alignment, font, size, low image contrast and complex background make the problem of text extraction extremely challenging. In this paper, we propose a texture-based text extraction method using DWT with K-means clustering. First, the edges are detected from image by using DWT. Then, a small size overlapped sliding window is used to scan high frequency component sub-bands from which texture features of text and non-text regions are extracted. Based on these features, K-means clustering is employed to classify the image into text, simple background and complex background clusters. Finally, voting decision process and area based filtering are used to locate text regions exactly. Experimentation is carried out using public dataset ICDAR 2013 and our own dataset for English, Hindi and Punjabi text images for different number of clusters. The results show that the proposed method gives promising results with different languages in terms of detection rate (DR), precision rate (PR) and recall rate (RR).
引用
收藏
页码:900 / 916
页数:17
相关论文
共 34 条
  • [1] Angadi S. A., 2010, INT J IMAGE PROCESSI, V3, P229
  • [2] Anoual H., 2010, P 5 INT S 1 5 COMM M, P1, DOI DOI 10.1109/ISVC.2010.5656284
  • [3] A survey on the use of pattern recognition methods for abstraction, indexing and retrieval of images and video
    Antani, S
    Kasturi, R
    Jain, R
    [J]. PATTERN RECOGNITION, 2002, 35 (04) : 945 - 965
  • [4] A Robust Multilingual Text Detection Approach Based on Transforms and Wavelet Entropy
    Aradhya, V. N. Manjunath
    Pavithra, M. S.
    Naveena, C.
    [J]. 2ND INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION, CONTROL AND INFORMATION TECHNOLOGY (C3IT-2012), 2012, 4 : 232 - 237
  • [5] Azadboni MK, 2012, 2012 SIXTH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), P794, DOI 10.1109/ISTEL.2012.6483094
  • [6] Grover S, 2009, ANNU IEEE IND CONF, P582
  • [7] Accurate text localization in images based on SVM output scores
    Jung, Cheolkon
    Liu, Qifeng
    Kim, Joongkyu
    [J]. IMAGE AND VISION COMPUTING, 2009, 27 (09) : 1295 - 1301
  • [8] Text information extraction in images and video: a survey
    Jung, K
    Kim, KI
    Jain, AK
    [J]. PATTERN RECOGNITION, 2004, 37 (05) : 977 - 997
  • [9] A New Approach for Overlay Text Detection and Extraction From Complex Video Scene
    Kim, Wonjun
    Kim, Changick
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2009, 18 (02) : 401 - 411
  • [10] Kumar M., 2010, Proceedings of the 2010 IEEE 10th International Conference on Computer and Information Technology (CIT 2010), P1413, DOI 10.1109/CIT.2010.253