A DEEP LEARNING BASED CHARACTER RECOGNITION SYSTEM FROM MULTIMEDIA DOCUMENT

被引:0
作者
Yadav, Usha [1 ]
Verma, Satya [1 ]
Xaxa, Deepak Kumar [1 ]
Mahobiya, Chandrakant [1 ]
机构
[1] MATS Univ, CSE, Raipur, Madhya Pradesh, India
来源
2017 INNOVATIONS IN POWER AND ADVANCED COMPUTING TECHNOLOGIES (I-PACT) | 2017年
关键词
Character recognition; diagonal based feature extraction; Convolution neural network; image processing;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Text recognition from natural scene images is very tough task now these days compare than videos. Application of image processing called pattern recognition make easy to recognize text from multimedia documents. A pattern can be fingerprint image, handwritten word sample, human face images, speech signal and DNA sequence etc or we can say that all pattern are in machine editable form. Text can be recognized with and without segmentation of character. Segmentation can be line, word or character level and without segmentation character is recognized from whole text image. Character recognition is a field of research and various research has been done in the area of pattern recognition. There we use a new technique called diagonal based feature extraction in last layer of convolutional neural network and make feature extraction easy with the help of genetic algorithm. After extraction of feature we provide training to extreme learning machine. Along this feature extraction technique we use feed forward network as a classifier and convolution neural network for feature extractor. It is a deep learning based technique of neural network which use for classification or recognition of text. This is basically used for providing training and in testing phase. CRConvNet has more layers working of all layer shown in flowchart. One dataset which contain 360 training set data that are all in capital(A-Z) and small(a-z) alphabet, digit(0-9) and some special character are also used. Another dataset contains samples of video and images (ICDAR 2003) for testing. Extensive studies shows that the recognition system which using diagonal based feature learning provide high recognition accuracy while requiring less time for training.
引用
收藏
页数:7
相关论文
共 30 条
[1]  
Anam Sarawat, 2009, INT MULT ENG COMP SC
[2]  
[Anonymous], IDSIA0412
[3]   A survey of methods and strategies in character segmentation [J].
Casey, RG ;
Lecolinet, E .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1996, 18 (07) :690-706
[4]  
Coates Adam, INT C DOC AN REC 201
[5]  
Dinesh Acharya U., 2009, INT J INFORM TECHNOL, V2, P305
[6]  
Duin RPW, 2002, INT C PATT RECOG, P765, DOI 10.1109/ICPR.2002.1048415
[7]   Text recognition in multimedia documents: a study of two neural-based OCRs using and avoiding character segmentation [J].
Elagouni, Khaoula ;
Garcia, Christophe ;
Mamalet, Franck ;
Sebillot, Pascale .
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2014, 17 (01) :19-31
[8]  
Goel Vibhor, 2013, INT C DOC AN REC AUG
[9]   Feature selection using probabilistic neural networks [J].
Hunter, A .
NEURAL COMPUTING & APPLICATIONS, 2000, 9 (02) :124-132
[10]  
Jindal Anita, 2012, INT J ADV RES COMPUT, V2