A graph-based solution for writer identification from handwritten text

被引:2
|
作者
Rahman, Atta Ur [1 ]
Halim, Zahid [1 ]
机构
[1] Ghulam Ishaq Khan Inst Engn Sci & Technol, Fac Comp Sci & Engn, Machine Intelligence Res Grp MInG, Topi, Pakistan
基金
英国科研创新办公室;
关键词
Writer identification; Preprocessing; Graph-based representation; Feature extraction; Ensemble learning; INDIVIDUALITY; CODEBOOK; FEATURES;
D O I
10.1007/s10115-022-01676-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Writer identification is an active research problem due to its applications in forensic and historic documents analysis. It is challenging to identify a writer from her handwritten characters' shapes produced via practiced writing style. Different writing shapes, styles, orientations, various sizes of characters, complex structures, inconsistency, and cursive nature of the text make it a tougher undertaking. To solve this problem, we need to explore a structural representation and spatial information of the handwritten characters. For this, a novel graph-based approach is proposed here to spatially map the handwritten text, adapt its structure, size, and explore the relationship that exist between them. First, image processing steps such as binarization, baseline correction, separation of the writing region, and thinning of the strokes to a width of a single pixel are executed. This work presents a novel algorithm for detecting key points (KPs) in a handwritten skeleton image and extracting their two-dimensional pixel coordinates values. The handwriting samples are then transformed into a graph-based representation with KPs representing nodes and the line segments connecting adjacent KPs as the edges. Features are extracted from the graph-based representations of the handwritten text. For classification, ensemble learning approaches are employed. Four benchmark datasets and one custom collected dataset are utilized for experimentations. The proposed solution achieves identification accuracies of 98.26%, 98.84%, 99.67%, 98.51%, and 97.73%, on CERUG-EN, CVL, Firemaker, IAM, and custom datasets, respectively.
引用
收藏
页码:1501 / 1523
页数:23
相关论文
共 50 条
  • [1] A graph-based solution for writer identification from handwritten text
    Atta Ur Rahman
    Zahid Halim
    Knowledge and Information Systems, 2022, 64 : 1501 - 1523
  • [2] Open writer identification from handwritten text fragments using lite convolutional neural network
    Briber, Amina
    Chibani, Youcef
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2024, 27 (04) : 529 - 551
  • [3] Writer Identification for Handwritten Words
    Pandey, Shilpa
    Harit, Gaurav
    COMPUTER VISION, GRAPHICS, AND IMAGE PROCESSING, ICVGIP 2016, 2017, 10481 : 265 - 276
  • [4] Writer identification on mobile device based on handwritten
    Kutzner, Tobias
    Travieso, Carlos M.
    Boenninger, Ingrid
    Alonso, Jesus B.
    Luis Vasquez, Jose
    2013 47TH INTERNATIONAL CARNAHAN CONFERENCE ON SECURITY TECHNOLOGY (ICCST), 2013,
  • [5] Writer Identification from Handwritten Devanagari Script
    Halder, Chayan
    Thakur, Kishore
    Phadikar, Santanu
    Roy, Kaushik
    INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, VOL 2, 2015, 340 : 497 - 505
  • [6] Graph-based Turkish text normalization and its impact on noisy text processing
    Demir, Seniz
    Topcu, Berkay
    ENGINEERING SCIENCE AND TECHNOLOGY-AN INTERNATIONAL JOURNAL-JESTECH, 2022, 35
  • [7] Automatic writer identification from text line images
    Önder Kırlı
    M. Bilginer Gülmezoğlu
    International Journal on Document Analysis and Recognition (IJDAR), 2012, 15 : 85 - 99
  • [8] Automatic writer identification from text line images
    Kirli, Onder
    Gulmezoglu, M. Bilginer
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2012, 15 (02) : 85 - 99
  • [9] Deep adaptive learning for writer identification based on single handwritten word images
    He, Sheng
    Schomaker, Lambert
    PATTERN RECOGNITION, 2019, 88 : 64 - 74
  • [10] Automatic removal of crossed-out handwritten text and the effect on writer verification and identification
    Brink, Axel
    van der Klauw, Harro
    Schomaker, Lambert
    DOCUMENT RECOGNITION AND RETRIEVAL XV, 2008, 6815