Application of texture-based features for text non-text classification in printed document images with novel feature selection algorithm

被引:0
作者
Soulib Ghosh
S. K. Khalid Hassan
Ali Hussain Khan
Ankur Manna
Showmik Bhowmik
Ram Sarkar
机构
[1] Jadavpur University,Department of Computer Science and Engineering
[2] Ghani Khan Choudhury Institute of Engineering and Technology,Department of Computer Science and Engineering
来源
Soft Computing | 2022年 / 26卷
关键词
Feature selection; Text non-text separation; Particle swarm optimization; Local ternary pattern; Document image;
D O I
暂无
中图分类号
学科分类号
摘要
Text non-text separation is one of the most essential pre-processing steps for any optical character recognition (OCR) system. As an OCR engine can only process texts, the non-texts present in an input document image are required to be suppressed at the initial level. Therefore, to build a complete OCR system, an efficient text non-text separation module needs to be developed. To this end, we have proposed a texture-based feature descriptor followed by a novel feature selection technique for region-based text non-text classification. First, we have incorporated rotation invariant property with local ternary pattern to form a new texture-based feature descriptor, rotation invariant local ternary pattern (RILTP). Next, a novel feature selection technique is proposed which is a modified version of binary particle swarm optimization (BPSO). For the evaluation of the proposed text non-text classification method, we have initially constructed a database consisting of 690 images of text and non-text regions extracted from 70 pages of RDCL 2015 and 75 pages of RDCL 2017 page segmentation competitions databases. In this database, each class contains 345 data samples. The proposed texture-based feature descriptor has obtained an accuracy of 97.09% on this database. Whereas, after applying BPSO, the feature dimension is reduced by approximately 55% and at the same time, the accuracy reaches 97.5%. Furthermore, in this work, another database is also created from Media team document pages to validate the robustness of this method. The second database comprises 100 text and 100 non-text images. The method has achieved 96.28% accuracy when it is trained with the first database and tested with the second database. The comparative study reveals the robustness and strength of the proposed method as it outnumbers many state-of-the-art texture-based features. Besides, the proposed feature selection method is also compared with various standard feature selection methods, and it has been observed that the proposed one outperforms all those methods considered here for comparison.
引用
收藏
页码:891 / 909
页数:18
相关论文
共 114 条
[1]  
Asif MDA(2014)A novel hybrid method for text detection and extraction from news videos Middle East J Sci Res 19 716-722
[2]  
Tariq UU(2006)Multi-objective genetic algorithms: a way to improve the convergence rate Eng Appl Artif Intell 19 501-510
[3]  
Baig MN(1994)Using mutual information for selecting features in supervised neural net learning IEEE Trans Neural Netw 5 531-1559
[4]  
Ahmad W(2014)A survey on filter techniques for feature selection in text mining Adv Intell Syst Comput 236 1545-3570
[5]  
Augusto OB(2021)Understanding contents of filled-in Bangla form images Multimed Tools Appl 80 3529-20
[6]  
Rabeau S(2018)Text and non-text separation in offline document images: a survey Int J Doc Anal Recognit 21 1-249
[7]  
Dépincé P(2007)Feature selection for high-dimensional data—a pearson redundancy based filter Adv Soft Comput 45 242-224
[8]  
Bennis F(2005)Estimating optimal feature subsets using efficient estimation of high-dimensional mutual information IEEE Trans Neural Networks 16 213-19419
[9]  
Battiti R(2017)Facial expression recognition from image sequence based on LBP and taylor expansion IEEE Access 5 19409-523
[10]  
Bharti KK(1999)Distributed genetic algorithm for deterministic and stochastic labor scheduling problems Eur J Oper Res 118 505-1555