A Feature Vector for Optical Character Recognition

被引:0
|
作者
Zarei, Ariyan [1 ]
Shooshtari, Arman Yousefzadeh [1 ]
机构
[1] Shahid Beheshti Univ, Dept Comp Sci, Tehran, Iran
来源
PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND SYSTEM (ICISS 2018) | 2018年
关键词
Optical character recognition; pattern recognition; classification; feature vector;
D O I
10.1145/3209914.3209942
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The extraction of the written text in an image has always been an important application of computer vision since it was introduced. It is widely used in automatic number plate recognition, handwriting recognition, extracting data from scanned documents such as passports, ID cards, banking forms, etc. There exist a wide variety of approaches to the general problem of optical character recognition such as Template Matching, Structural Classification, Artificial Neural Networks, etc. In this paper we introduced a new feature vector for optical character recognition and we tested its accuracy by using a Nearest Neighbor classifier. The new feature vector is a sequence generated by putting together the orientations of each pixel to a base point. The classifier then, is simply Longest Common Subsequence algorithm. In other words, a new image contains a character if and only if the corresponding sequence of the image has the longest common subsequence with the feature vector or sequence of that character among all the characters available. The experiments provided us with satisfying results which can be definitely better under better classifiers such as RNN or SVM.
引用
收藏
页码:133 / 136
页数:4
相关论文
共 50 条
  • [1] Optical character recognition with feature extraction and associative memory matrix
    Sasaki, O
    Shibahara, A
    Suzuki, T
    OPTICAL ENGINEERING, 1998, 37 (06) : 1827 - 1833
  • [2] Evaluation of Optical Character Recognition Algorithms and Feature Extraction Techniques
    Tanvir, Syed Hassan
    Khan, Tamim Ahmed
    Yamin, Abu Bakar
    2016 SIXTH INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING TECHNOLOGY (INTECH), 2016, : 326 - 331
  • [3] Printed Arabic Optical Character Recognition using Support vector machine
    Yamina, Ouled Jaafri
    El Mamoun, Mamouni
    Kaddour, Sadouni
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON MATHEMATICS AND INFORMATION TECHNOLOGY (ICMIT), 2017, : 134 - 140
  • [4] Optical Character Recognition Based on Least Square Support Vector Machine
    Xie, Jianhong
    2009 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL 1, PROCEEDINGS, 2009, : 626 - 629
  • [5] Optical Character Recognition: An Overview and an Insight
    Berchmans, Deepa
    Kumar, S. S.
    2014 INTERNATIONAL CONFERENCE ON CONTROL, INSTRUMENTATION, COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICCICCT), 2014, : 1361 - 1365
  • [6] An analog VLSI implementation of a feature extractor for real time optical character recognition
    Bo, GM
    Caviglia, D
    Valle, M
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 1998, 33 (04) : 556 - 564
  • [7] FEATURE-BASED NEURAL WAVELET OPTICAL CHARACTER-RECOGNITION SYSTEM
    IFTEKHARUDDIN, KM
    SCHECHINGER, TD
    JEMILI, K
    KARIM, MA
    OPTICAL ENGINEERING, 1995, 34 (11) : 3193 - 3199
  • [8] COMPARISON BETWEEN NEURAL NETWORK AND SUPPORT VECTOR MACHINE IN OPTICAL CHARACTER RECOGNITION
    Phangtriastua, Michael Reynaldo
    Harefa, Jeklin
    Tanoto, Dian Felita
    DISCOVERY AND INNOVATION OF COMPUTER SCIENCE TECHNOLOGY IN ARTIFICIAL INTELLIGENCE ERA, 2017, 116 : 351 - 357
  • [9] Optical character recognition system for Baybayin scripts using support vector machine
    Pino, Rodney
    Mendoza, Renier
    Sambayan, Rachelle
    PEERJ COMPUTER SCIENCE, 2021, 7 : 1 - 24
  • [10] Feature Vector-Based Artificial Neural Network Classification Model for Handwritten Character Recognition
    Mohamada, Muhammad Arif
    Haron, Habibollah
    Hasan, Haswadi
    NEW TRENDS IN INTELLIGENT SOFTWARE METHODOLOGIES, TOOLS AND TECHNIQUES (SOMET_18), 2018, 303 : 409 - 422