A Feature Vector for Optical Character Recognition

被引:0
|
作者
Zarei, Ariyan [1 ]
Shooshtari, Arman Yousefzadeh [1 ]
机构
[1] Shahid Beheshti Univ, Dept Comp Sci, Tehran, Iran
来源
PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND SYSTEM (ICISS 2018) | 2018年
关键词
Optical character recognition; pattern recognition; classification; feature vector;
D O I
10.1145/3209914.3209942
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The extraction of the written text in an image has always been an important application of computer vision since it was introduced. It is widely used in automatic number plate recognition, handwriting recognition, extracting data from scanned documents such as passports, ID cards, banking forms, etc. There exist a wide variety of approaches to the general problem of optical character recognition such as Template Matching, Structural Classification, Artificial Neural Networks, etc. In this paper we introduced a new feature vector for optical character recognition and we tested its accuracy by using a Nearest Neighbor classifier. The new feature vector is a sequence generated by putting together the orientations of each pixel to a base point. The classifier then, is simply Longest Common Subsequence algorithm. In other words, a new image contains a character if and only if the corresponding sequence of the image has the longest common subsequence with the feature vector or sequence of that character among all the characters available. The experiments provided us with satisfying results which can be definitely better under better classifiers such as RNN or SVM.
引用
收藏
页码:133 / 136
页数:4
相关论文
共 50 条
  • [41] A Survey on various Optical Character Recognition Techniques
    Sabu, Abin M.
    Das, Anto Sahaya
    2018 CONFERENCE ON EMERGING DEVICES AND SMART SYSTEMS (ICEDSS), 2018, : 152 - 155
  • [42] Authorship Attribution and Optical Character Recognition Errors
    Juola, Patrick
    Noecker, John I., Jr.
    Ryan, Michael V.
    TRAITEMENT AUTOMATIQUE DES LANGUES, 2012, 53 (03): : 101 - 127
  • [43] Optical character recognition for degraded text documents
    Sanyal, Sudip
    Dhingra, Kapil Dev
    Sharma, Pramod Kumar
    IMECS 2007: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2007, : 1988 - +
  • [44] Applying SIMD to optical character recognition (OCR)
    Yu, Guan
    Gauthier, Lafruit
    Stahl, Richard
    Corporaal, Henk
    Schelkens, Peter
    OPTICAL AND DIGITAL IMAGE PROCESSING, 2008, 7000
  • [45] Optical character recognition: An illustrated guide to the frontier
    Nagy, G
    Nartker, TA
    Rice, SV
    DOCUMENT RECOGNITION AND RETRIEVAL VII, 2000, 3967 : 58 - 69
  • [46] A Hand-written Character Recognition Based on Feature Coding
    Du, Yanrui
    Chen, Lihui
    Xie, Bin
    MECHATRONICS, ROBOTICS AND AUTOMATION, PTS 1-3, 2013, 373-375 : 594 - +
  • [47] A Hybrid Approach for Feature Extraction in Malayalam Handwritten Character Recognition
    Sujala, K.
    James, Ajay
    Saravanan, C.
    PROCEEDINGS OF THE 2017 IEEE SECOND INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND COMMUNICATION TECHNOLOGIES (ICECCT), 2017,
  • [48] Sensing Analysis of Feature Extraction Types for Handwritten Character Recognition
    Gao, Jin-feng
    Yao, Ru-xian
    Zhang, Yu
    Lai, Han
    Zhang, Jun-ming
    Chang, Ting-Cheng
    SENSORS AND MATERIALS, 2020, 32 (05) : 1711 - 1729
  • [49] New Feature Vector for Recognition of Short Microbial Genes
    Goli, Baharak
    Aswathi, B. L.
    Joy, Chinu
    Nair, Achuthsankar S.
    ECO-FRIENDLY COMPUTING AND COMMUNICATION SYSTEMS, 2012, 305 : 222 - +
  • [50] Handwritten Optical Character Recognition (OCR): A Comprehensive Systematic Literature Review (SLR)
    Memon, Jamshed
    Sami, Maira
    Khan, Rizwan Ahmed
    Uddin, Mueen
    IEEE ACCESS, 2020, 8 : 142642 - 142668