Database for Arabic Printed Text Recognition Research

被引:0
|
作者
Jaiem, Faten Kallel [1 ]
Kanoun, Slim [1 ]
Khemakhem, Maher [1 ]
El Abed, Haikal [3 ]
Kardoun, Jihain [2 ]
机构
[1] Univ Sfax, ISIMS, MIRACL Lab, Sfax, Tunisia
[2] Univ Sfax, ENIS, Dept Comp Engn, Sfax, Tunisia
[3] Tech Univ Carolo Wilhelmina Braunschweig, Inst Commun Technol, Braunschweig, Germany
关键词
Arabic printed text; APTID / MF database; Open vocabulary; Ground truth; PATTERN-RECOGNITION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a real database for the Arabic printed text recognition, APTID / MF (Arabic Printed Text Image Database / Multi-Font). This database can be used to evaluate the system that recognizes Arabic printed texts with an open vocabulary. APTID / MF may be also used for research in word segmentation and font identification. APTID / MF is obtained from 387 pages of Arabic printed documents scanned with grayscale format and 300 dpi resolutions. From this documents, 1,845 text-blocks have been extracted. In addition ground truth file is provided for each texts-block. APTID / MF also includes an Arabic printed character image dataset made up of 27,402 samples. The database is freely available to interested researchers.
引用
收藏
页码:251 / 259
页数:9
相关论文
共 50 条
  • [1] Benchmark database and GUI environment for printed arabic text recognition research
    Al-Hashim, Amin G.
    Mahmoud, Sabri A.
    WSEAS Transactions on Information Science and Applications, 2010, 7 (04): : 587 - 597
  • [2] Printed Arabic Text Database for Automatic Recognition Systems
    Bouressace, Hassina
    Csirik, Janos
    PROCEEDINGS OF THE 2019 5TH INTERNATIONAL CONFERENCE ON COMPUTER AND TECHNOLOGY APPLICATIONS (ICCTA 2019), 2019, : 107 - 111
  • [3] Printed Arabic Text Database (PATDB) for Research and Benchmarking
    Al-Hashim, Amin G.
    Mahmoud, Sabri A.
    RECENT ADVANCES AND APPLICATIONS OF COMPUTER ENGINEERING: PROCEEDINGS OF THE 9TH WSEAS INTERNATIONAL CONFERENCE (ACE 10), 2010, : 62 - +
  • [4] PRINTED ARABIC TEXT RECOGNITION
    HASSAN, FH
    ALI, WH
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 1991, 16 (04): : 511 - 518
  • [5] ALTID : Arabic/Latin Text Images Database for recognition research
    Chtourou, Imen
    Rouhou, Ahmed Cheikh
    Jaiem, Faten Kallel
    Kanoun, Slim
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 836 - 840
  • [6] A database for Arabic printed character recognition
    AbdeRaouf, Ashraf
    Higgins, Colin A.
    Khalil, Mahmoud
    IMAGE ANALYSIS AND RECOGNITION, PROCEEDINGS, 2008, 5112 : 567 - +
  • [7] MACHINE RECOGNITION AND CORRECTION OF PRINTED ARABIC TEXT
    AMIN, A
    MARI, JF
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1989, 19 (05): : 1300 - 1306
  • [8] Optical character recognition of arabic printed text
    Electrical and Electronics Engineering Department, University of Khartoum, Sudan
    SCOReD - IEEE Stud. Conf. Res. Dev., (235-240):
  • [9] Optical Character Recognition of Arabic Printed Text
    Taha, Safwa
    Babiker, Yusra
    Abbas, Mohamed
    2012 IEEE STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT (SCORED), 2012,
  • [10] A Database for Offline Arabic Handwritten Text Recognition
    Mahmoud, Sabri A.
    Ahmad, Irfan
    Alshayeb, Mohammed
    Al-Khatib, Wasfi G.
    IMAGE ANALYSIS AND RECOGNITION: 8TH INTERNATIONAL CONFERENCE, ICIAR 2011, PT II: 8TH INTERNATIONAL CONFERENCE, ICIAR 2011, 2011, 6754 : 397 - 406