A graph-based solution for writer identification from handwritten text

被引:2
|
作者
Rahman, Atta Ur [1 ]
Halim, Zahid [1 ]
机构
[1] Ghulam Ishaq Khan Inst Engn Sci & Technol, Fac Comp Sci & Engn, Machine Intelligence Res Grp MInG, Topi, Pakistan
基金
英国科研创新办公室;
关键词
Writer identification; Preprocessing; Graph-based representation; Feature extraction; Ensemble learning; INDIVIDUALITY; CODEBOOK; FEATURES;
D O I
10.1007/s10115-022-01676-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Writer identification is an active research problem due to its applications in forensic and historic documents analysis. It is challenging to identify a writer from her handwritten characters' shapes produced via practiced writing style. Different writing shapes, styles, orientations, various sizes of characters, complex structures, inconsistency, and cursive nature of the text make it a tougher undertaking. To solve this problem, we need to explore a structural representation and spatial information of the handwritten characters. For this, a novel graph-based approach is proposed here to spatially map the handwritten text, adapt its structure, size, and explore the relationship that exist between them. First, image processing steps such as binarization, baseline correction, separation of the writing region, and thinning of the strokes to a width of a single pixel are executed. This work presents a novel algorithm for detecting key points (KPs) in a handwritten skeleton image and extracting their two-dimensional pixel coordinates values. The handwriting samples are then transformed into a graph-based representation with KPs representing nodes and the line segments connecting adjacent KPs as the edges. Features are extracted from the graph-based representations of the handwritten text. For classification, ensemble learning approaches are employed. Four benchmark datasets and one custom collected dataset are utilized for experimentations. The proposed solution achieves identification accuracies of 98.26%, 98.84%, 99.67%, 98.51%, and 97.73%, on CERUG-EN, CVL, Firemaker, IAM, and custom datasets, respectively.
引用
收藏
页码:1501 / 1523
页数:23
相关论文
共 50 条
  • [21] Text and Script Independent Writer Identification
    Dhandra, B. V.
    Vijayalaxmi, M. B.
    2014 INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2014, : 586 - 590
  • [22] Automatic writer identification framework for online handwritten documents using character prototypes
    Tan, Guo Xian
    Viard-Gaudin, Christian
    Kot, Alex C.
    PATTERN RECOGNITION, 2009, 42 (12) : 3313 - 3323
  • [23] A Novel Approach to Text Dependent Writer Identification of Kannada Handwriting
    Dhandra, B. V.
    Vijayalaxmi, M. B.
    PROCEEDINGS OF 4TH INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND CONTROL(ICAC3'15), 2015, 49 : 33 - 41
  • [24] Writer-aware CNN for parsimonious HMM-based offline handwritten Chinese text recognition
    Wang, Zi-Rui
    Du, Jun
    Wang, Jia-Ming
    PATTERN RECOGNITION, 2020, 100
  • [25] Text independent writer identification based on Gabor filter and SVM classifier
    Feng Jun
    Zhu Yanhai
    SIGNAL ANALYSIS, MEASUREMENT THEORY, PHOTO-ELECTRONIC TECHNOLOGY, AND ARTIFICIAL INTELLIGENCE, PTS 1 AND 2, 2006, 6357
  • [26] Writer Identification in Historical Handwritten Documents: A Latin Dataset and a Benchmark
    Fagioli, Alessio
    Avola, Danilo
    Cinque, Luigi
    Colombi, Emanuela
    Foresti, Gian Luca
    IMAGE ANALYSIS AND PROCESSING - ICIAP 2023 WORKSHOPS, PT II, 2024, 14366 : 465 - 476
  • [27] Using Codebooks Generated From Text Skeletonization for Forensic Writer Identification
    Al-Maadeed, Somaya
    Hassaine, Abdelaali
    Bouridan, Ahmed
    2014 IEEE/ACS 11TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2014, : 729 - 733
  • [28] A texture-based approach for offline writer identification
    Bahram, Tayeb
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (08) : 5204 - 5222
  • [29] Offline text-independent writer identification using codebook and efficient code extraction methods
    Ghiasi, Golnaz
    Safabakhsh, Reza
    IMAGE AND VISION COMPUTING, 2013, 31 (05) : 379 - 391
  • [30] Text-Dependent and Text-Independent Writer Identification Approaches: Challenges and Future Directions
    Kaur, Rajandeep
    Rani, Rajneesh
    Pahuja, Roop
    INTERNATIONAL JOURNAL OF SOFTWARE INNOVATION, 2022, 10 (01)