Hidden tree Markov models for document image classification

被引:0
作者
Diligenti, M
Frasconi, P
Gori, M
机构
[1] Univ Siena, Dipartimento Ingn Informazione, I-53100 Siena, Italy
[2] Univ Florence, Dipartimento Sistemi & Informat, I-50139 Florence, Italy
关键词
document classification; machine learning; Markovian models; structured information;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Classification is an important problem in image document processing and is often a preliminary step toward recognition, understanding, and information extraction. In this paper, the problem is formulated in the framework of concept learning and each category corresponds to the set of image documents with similar physical structure. We propose a solution based on two algorithmic ideas. First, we obtain a structured representation of images based on labeled XY-trees (this representation informs the learner about important relationships between image subconstituents). Second, we propose a probabilistic architecture that extends hidden Markov models for learning probability distributions defined on spaces of labeled trees. Finally, a successful application of this method to the categorization of commercial invoices is presented.
引用
收藏
页码:519 / 523
页数:5
相关论文
共 21 条
[1]  
[Anonymous], 1993, P 13 INT JOINT C ART
[2]  
[Anonymous], P 7 INT C PATT REC M
[3]  
APPIANI A, 2002, INT J DOC ANAL RECOG, V4, P69
[4]  
Bengio Y., 1995, Advances in Neural Information Processing Systems 7, P427
[5]  
BRUGGER R, 1997, P INT C DOC AN REC
[6]  
Cesarini F., 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318), P563, DOI 10.1109/ICDAR.1999.791850
[7]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[8]  
Dengel A., 1993, Proceedings of the Second International Conference on Document Analysis and Recognition (Cat. No.93TH0578-5), P86, DOI 10.1109/ICDAR.1993.395776
[9]  
Dengel A., 1995, Proceedings of the Third International Conference on Document Analysis and Recognition, P587, DOI 10.1109/ICDAR.1995.601965
[10]   A general framework for adaptive processing of data structures [J].
Frasconi, P ;
Gori, M ;
Sperduti, A .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1998, 9 (05) :768-786