Representation Learning and Dissimilarity for Writer Identification

被引:0
作者
Helal, Lucas G. [1 ]
Bertolini, Diego [2 ]
Costa, Yandre M. G. [1 ]
Cavalcanti, George D. C. [3 ]
Britto, Alceu S., Jr. [4 ]
Oliveira, Luiz E. S. [5 ]
机构
[1] State Univ Maringa UEM, Av Colombo 5790,Bloco C56, Maringa, Parana, Brazil
[2] Fed Technol Univ Parana UTFPR, Via Rosalina Maria Santos 1233, Campo Mourao, Parana, Brazil
[3] Fed Univ Pernambuco UFPE, Av Jornalista Anibal Fernandes S-N, Recife, PE, Brazil
[4] Pontifical Catholic Univ Parana PUCPR, Rua Imaculada Conceicao 1155, Curitiba, Parana, Brazil
[5] Fed Univ Parana UFPR, Rua Cel Francisco Heraclito Santos 100, Curitiba, Parana, Brazil
来源
PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING (IWSSIP 2019) | 2019年
关键词
Writer identification; feature learning; dissimilarity;
D O I
10.1109/iwssip.2019.8787293
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Writer identification by using manuscripts became a very important research topic in forensic analysis of documents. That is because the writing can be considered as an identifying characteristic of a person. By analyzing the challenges and the resources available for the research development in this field of investigation, one can find different databases of writers manuscripts. In this work, we aim to evaluate the performance of deep learning techniques in the process of writer identification by using manuscripts. For this, a convolutional neural network (CNN) was developed to address the writer identification task. We have also evaluated the performance of features obtained with CNN when submitted to a support vector machine (SVM) classifier, considering the traditional classification approach. To assert the objectives of this proposal, CVL and BFL databases were used. The experiments were conducted by using a texture generation approach, starting from the original documents. Feature learning was accomplished using CNN on the texture obtained from the manuscripts. Finally, we have also evaluated the impact of the classification using both classification scenarios with or without using the dissimilarity approach. The best results were achieved using the SVM classifier and dissimilarity approach on the features obtained with CNN. The performance demonstrated the robustness of the proposed method with performance similar or superior to that described in the literature.
引用
收藏
页码:63 / 68
页数:6
相关论文
共 28 条
[1]   QUWI: An Arabic and English Handwriting Dataset for Offline Writer Identification [J].
Al Maadeed, Somaya ;
Ayouby, Wael ;
Hassaine, Abdelaali ;
Aljaam, Jihad Mohamad .
13TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2012), 2012, :746-751
[2]  
[Anonymous], THESIS
[3]  
[Anonymous], 2018, SEMISUPERVISED FEATU
[4]  
Baranoski F. L., 2007, 33 LAT AM C INF SAN, V1, P25
[5]   Representation Learning: A Review and New Perspectives [J].
Bengio, Yoshua ;
Courville, Aaron ;
Vincent, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828
[6]   Texture-based descriptors for writer identification and verification [J].
Bertolini, D. ;
Oliveira, L. S. ;
Justino, E. ;
Sabourin, R. .
EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (06) :2069-2080
[7]  
Cha SH, 2000, LECT NOTES COMPUT SC, V1876, P123
[8]  
Freitas C., 2008, 11 INT WORKSH FRONT
[9]  
Grosicki E., 2008, ICFHR 2008 11 INT C, P1
[10]   Writer verification using texture-based features [J].
Hanusiak, R. K. ;
Oliveira, L. S. ;
Justino, E. ;
Sabourin, R. .
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2012, 15 (03) :213-226