A graph-based solution for writer identification from handwritten text

被引:2
|
作者
Rahman, Atta Ur [1 ]
Halim, Zahid [1 ]
机构
[1] Ghulam Ishaq Khan Inst Engn Sci & Technol, Fac Comp Sci & Engn, Machine Intelligence Res Grp MInG, Topi, Pakistan
基金
英国科研创新办公室;
关键词
Writer identification; Preprocessing; Graph-based representation; Feature extraction; Ensemble learning; INDIVIDUALITY; CODEBOOK; FEATURES;
D O I
10.1007/s10115-022-01676-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Writer identification is an active research problem due to its applications in forensic and historic documents analysis. It is challenging to identify a writer from her handwritten characters' shapes produced via practiced writing style. Different writing shapes, styles, orientations, various sizes of characters, complex structures, inconsistency, and cursive nature of the text make it a tougher undertaking. To solve this problem, we need to explore a structural representation and spatial information of the handwritten characters. For this, a novel graph-based approach is proposed here to spatially map the handwritten text, adapt its structure, size, and explore the relationship that exist between them. First, image processing steps such as binarization, baseline correction, separation of the writing region, and thinning of the strokes to a width of a single pixel are executed. This work presents a novel algorithm for detecting key points (KPs) in a handwritten skeleton image and extracting their two-dimensional pixel coordinates values. The handwriting samples are then transformed into a graph-based representation with KPs representing nodes and the line segments connecting adjacent KPs as the edges. Features are extracted from the graph-based representations of the handwritten text. For classification, ensemble learning approaches are employed. Four benchmark datasets and one custom collected dataset are utilized for experimentations. The proposed solution achieves identification accuracies of 98.26%, 98.84%, 99.67%, 98.51%, and 97.73%, on CERUG-EN, CVL, Firemaker, IAM, and custom datasets, respectively.
引用
收藏
页码:1501 / 1523
页数:23
相关论文
共 50 条
  • [41] GRAPH-BASED EXTRACTION OF PROTRUSIONS AND DEPRESSIONS FROM BOUNDARY REPRESENTATIONS
    GAVANKAR, P
    HENDERSON, MR
    COMPUTER-AIDED DESIGN, 1990, 22 (07) : 442 - 450
  • [42] A Text-Independent Persian Writer Identification System Using LCS Based Classifier
    Helli, Behzad
    Moghaddam, Mohsen Ebrahimi
    ISSPIT: 8TH IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, 2008, : 203 - +
  • [43] DCWI: Distribution descriptive curve and Cellular automata based Writer Identification
    Kumar, Parveen
    Sharma, Ambalika
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 128 : 187 - 200
  • [44] A microstructure feature based text-independent method of writer identification for multilingual handwritings
    Li, Xin
    Ding, Xiao-Qing
    Peng, Liang-Rui
    Zidonghua Xuebao/ Acta Automatica Sinica, 2009, 35 (09): : 1199 - 1208
  • [45] Writer Identification for Offline Handwritten Kanji without using Character Recognition Features
    Soma, Ayumu
    Arai, Masayuki
    PROCEEDINGS OF THE 2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY APPLICATIONS (ICISTA-2013), 2013, 58 : 96 - 98
  • [46] Intelligent information extraction from government on-site inspection reports of construction projects: A graph-based text mining approach
    Liu, Muyang
    Luo, Xiaowei
    Wang, Guangbin
    Lu, Wei-Zhen
    ADVANCED ENGINEERING INFORMATICS, 2023, 58
  • [47] Writer Identification using a Probabilistic Model of Handwritten Digits and Approximate Bayesian Computation
    Ahmadian, Amirhosein
    Fouladi, Kazim
    Araabi, Babak Nadjar
    2016 2ND INTERNATIONAL CONFERENCE OF SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS), 2016, : 40 - 45
  • [48] Arabic Writer Identification Using Local Binary Patterns (LBP) of Handwritten Fragments
    Hannad, Yaacoub
    Siddiqi, Imran
    El Kettani, Mohamed El Youssfi
    PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2015), 2015, 9117 : 237 - 244
  • [49] Writer Identification using TF-IDF for Cursive Handwritten Word Recognition
    Bui, Quang Anh
    Visani, Muriel
    Prum, Sophea
    Ogier, Jean-Marc
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 844 - 848
  • [50] GBEx - towards Graph-Based Explanations
    Mroz, Pawel
    Quemy, Alexandre
    Slanynski, Mateusz
    Kluza, Krzysztof
    Jemiolo, Pawel
    2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 112 - 117