A deep learning based system for writer identification in handwritten Arabic historical manuscripts

被引:0
|
作者
Michel Chammas
Abdallah Makhoul
Jacques Demerjian
Elie Dannaoui
机构
[1] University of Balamand,Digital Humanities Center
[2] Université de Bourgogne Franche-Comté,Femto
[3] Lebanese University,ST Institute, UMR CNRS 6174
来源
Multimedia Tools and Applications | 2022年 / 81卷
关键词
Writer identification; Historical documents; Artificial intelligence; Document analysis; Arabic manuscripts;
D O I
暂无
中图分类号
学科分类号
摘要
Determining the writer or transcriber of historical Arabic manuscripts has always been a major challenge for researchers in the field of humanities. With the development of advanced techniques in pattern recognition and machine learning, these technologies have been applied to automate the extraction of paleographical features in order to solve this issue. This paper presents a baseline system for writer identification, tested on a Historical Arabic dataset of 11610 single and double folio images. These texts were extracted from a unique collection of 567 Historical Arabic Manuscripts available at the Balamand Digital Humanities Center. A survey has been conducted on the available Arabic datasets and previously proposed techniques and algorithms. The Balamand dataset presents an important challenge due to the geo-historical identity of manuscripts and their physical conditions. An advanced Deep Learning system was developed and tested on three different Latin and Arabic datasets: ICDAR19, ICFHR20 and KHATT, before testing it on the Balamand dataset. The system was compared with many other systems and it has yielded a state-of-the-art performance on the new challenging images with 95.2% mean Average Precision (mAP) and 98.1% accuracy.
引用
收藏
页码:30769 / 30784
页数:15
相关论文
共 50 条
  • [31] Forged document detection and writer identification through unsupervised deep learning approach
    Prachi Tyagi
    Khushboo Agarwal
    Garima Jaiswal
    Arun Sharma
    Ritu Rani
    Multimedia Tools and Applications, 2024, 83 : 18459 - 18478
  • [32] Writer Identification using Deep Learning with FAST Keypoints and Harris corner detector
    Semma, Abdelillah
    Hannad, Yaacoub
    Siddiqi, Imran
    Djeddi, Chawki
    El Youssfi El Kettani, Mohamed
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 184
  • [33] A Novel Approach for off-Line Arabic Writer Identification Based on Stroke Feature Combination
    Abdi, Mohamed Nidhal
    Khemakhem, Maher
    Ben-Abdallah, Hanene
    2009 24TH INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2009, : 595 - 598
  • [34] An Experimental Comparison between Deep Learning and Classical Machine Learning Approaches for Writer Identification in Medieval Documents
    Cilia, Nicole Dalia
    De Stefano, Claudio
    Fontanella, Francesco
    Marrocco, Claudio
    Molinara, Mario
    Freca, Alessandra Scotto di
    JOURNAL OF IMAGING, 2020, 6 (09)
  • [35] A-VLAD: An End-to-End Attention-Based Neural Network for Writer Identification in Historical Documents
    Ngo, Trung Tan
    Nguyen, Hung Tuan
    Nakagawa, Masaki
    DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT II, 2021, 12822 : 396 - 409
  • [36] The System of Writer Identification Based on ASP.net and MATLAB
    Han, Xiaojun
    Wang, Xiaoming
    PROCEEDINGS OF 2010 ASIA-PACIFIC YOUTH CONFERENCE ON COMMUNICATION, VOLS 1 AND 2, 2010, : 594 - +
  • [37] A writer identification and verification system using HMM based recognizers
    Schlapbach, Andreas
    Bunke, Horst
    PATTERN ANALYSIS AND APPLICATIONS, 2007, 10 (01) : 33 - 43
  • [38] A writer identification and verification system using HMM based recognizers
    Andreas Schlapbach
    Horst Bunke
    Pattern Analysis and Applications, 2007, 10 : 33 - 43
  • [39] Writer identification system for pre-segmented offline handwritten Devanagari characters using k-NN and SVM
    Dargan, Shaveta
    Kumar, Munish
    Garg, Anupam
    Thakur, Kutub
    SOFT COMPUTING, 2020, 24 (13) : 10111 - 10122
  • [40] Writer identification system for pre-segmented offline handwritten Devanagari characters using k-NN and SVM
    Shaveta Dargan
    Munish Kumar
    Anupam Garg
    Kutub Thakur
    Soft Computing, 2020, 24 : 10111 - 10122